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Abstract 

When securing complex infrastructures or large environments, constant surveil¬ 
lance of every area is not affordable. To cope with this issue, a common counter¬ 
measure is the usage of cheap but wide-ranged sensors, able to detect suspicious 
events that occur in large areas, supporting patrollers to improve the effectiveness 
of their strategies. However, such sensors are commonly affected by uncertainty. 
In the present paper, we focus on spatially uncertain alarm signals. That is, the 
alarm system is able to detect an attack but it is uncertain on the exact position 
where the attack is taking place. This is common when the area to be secured is 
wide such as in border patrolling and fair site surveillance. We propose, to the best 
of our knowledge, the first Patrolling Security Game model where a Defender is 
supported by a spatially uncertain alarm system which non-deterministically gen¬ 
erates signals once a target is under attack. We show that finding the optimal strat¬ 
egy in arbitrary graphs is AVX-hard even in zero-sum games and we provide two 
(exponential time) exact algorithms and two (polynomial time) approximation al¬ 
gorithms. Furthermore, we analyse what happens in environments with special 
topologies, showing that in linear and cycle graphs the optimal patrolling strategy 
can be found in polynomial time, de facto allowing our algorithms to be used in 
real-life scenarios, while in trees the problem is A/'T’-hard. Finally, we show that 
without false positives and missed detections, the best patrolling strategy reduces 
to stay in a place, wait for a signal, and respond to it at best. This strategy is opti¬ 
mal even with non-negligible missed detection rates, which, unfortunately, affect 
every commercial alarm system. We evaluate our methods in simulation, assessing 
both quantitative and qualitative aspects. 
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1. Introduction 


Security Games model the task of protecting physical environments as a non- 
cooperative game between a Defender and an Attacker [|T||. These games usually 
take place under a Stackelberg (a.k.a. leader-follower) paradigm where the 
Defender {leader) commits to a strategy and the Attacker (follower) first observes 
such commitment, then best responds to it. As discussed in the seminal work [jS], 
finding a leader-follower equilibrium is computationally tractable in games with 
one follower and complete information, while it becomes hard in Bayesian games 
with different types of Attacker. The availability of such computationally tractable 
aspects of Security Games led to the development of algorithms capable of scaling 
up to large problems, making them deployable in the security enforcing systems of 
several real-world applications. The first notable examples are the deployment of 
police checkpoints at the Los Angels International Airport 0 and the scheduling 
of federal air marshals over the U.S. domestic airline flights 1(51 . More recent case 
studies include the positioning of U.S. Coast Guard patrols to secure crowded 
places, bridges, and ferries (fill and the arrangement of city guards to stop fare 
evasion in Los Angeles Metro BTl . Finally, a similar approach is being tested and 
evaluated in Uganda, Africa, for the protection of wildlife [HI. Thus, Security 
Games emerged as an interesting game theoretical tool and then showed their on- 
the-field effectiveness in a number of real security scenarios. 

We focus on a specific class of security games, called Patrolling Security 
Games. These games are modelled as infinite-horizon extensive-form games in 
which the Defender controls one or more patrollers moving within an environ¬ 
ment represented as a discrete graph. The Attacker, besides having knowledge of 
the strategy to which the Defender committed to, can observe the movements of 
the patrollers at any time and use such information in deciding the most conve¬ 
nient time and target location to attack 0. When multiple patrollers are available, 
coordinating them at best is in general a hard task which, besides computational 
aspects, must also keep into account communication issues ifTOl . However, the pa¬ 
trolling problem is tractable, even with multiple patrollers, in border security (e.g., 
linear and cycle graphs), when patrollers have homogeneous moving and sensing 
capabilities and all the vertices composing the border share the same features ffTTlI . 
Scaling this model involved the study of how to compute patrolling strategies in 
scenarios where the Attacker is allowed to perform multiple attacks [fO . Simi¬ 
larly, coordination strategies among multiple Defenders are investigated in [|T3l . 
In [HH, the authors study the case in which there is a temporal discount on the 
targets. Extensions are discussed in lITSll . where coordination strategies between 
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defenders are explored, in IfT^ . where a resouree ean eover multiple targets, and 
in [fTTIl where attaeks ean deteeted at different stages with different assoeiated 
utilities. Finally, some theoretieal results about properties of speeifie patrolling 
settings are provided in ifTSlI . In the present paper, we provide a new model of 
patrolling seeurity games in whieh the Defender is supported by an alarm system 
deployed in the environment. 

1.1. Motivation scenarios 

Often, in large environments, a eonstant surveillanee of every area is not af¬ 
fordable while foeused inspeetions triggered by alarms are more eonvenient. Real- 
world applieations inelude UAVs surveillanee of large infrastruetures [|T9ll . wild¬ 
fires deteetion with CCD eameras ll20ll . agrieultural fields monitoring ll2Tll . and 
surveillanee based on wireless sensor networks ll2^ . and border patrolling [|2^ . 
Alarm systems are in praetiee affeeted by detection uneertainty, e.g., missed de- 
teetions and false positives, and localization (a.k.a. spatial) uneertainty, e.g., the 
alarm system is uneertain about the exaet target under attaek. We summarily de- 
seribe two praetieal seeurity problems that ean be aseribed to this eategory. We 
report them as examples, presenting features and requirements that our model ean 
properly deal with. In the rest of the paper we will neeessarily take a general 
stanee, but we eneourage the reader to keep in mind these two oases as referenee 
applieations for a real deployment of our model. 

1.1.1. Fight to illegal poaching 

Poaohing is a widespread environmental orime that oauses the endangerment 
of wildlife in several regions of the world. Its devastating impaot makes the de¬ 
velopment of surveillanee teohniques to oontrast this kind of aotivities one of the 
most important matters in national and international debates. Poaehing typioally 
takes plaoe over vast and savage areas, making it oostly and ineffeotive to solely 
rely on persistent patrol by ranger squads. To overeome this issue, reeent develop¬ 
ments have foeused on providing rangers with environmental monitoring systems 
to better plan their inspeetions, eoneentrating them in areas with large l ik elihood 
of spotting a erime. Sueh systems inelude the use of UAVs flying over the area, 
alarmed fenees, and on-the-field sensors trying to reeognize anomalous aetivi- 
tiesj^In all these eases, teehnologies are meant to work as an alarm system: onee 
the illegal aetivity is reeognized, a signal is sent to the rangers base station from 


'See, for example, http : / /wildlandsecurity. org/ 
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where a response is undertaken. In the great majority of eases, a signal eorre- 
sponds to a spatially uneertain loealization of the illegal aetivity. For example, a 
eamera-equipped UAV ean spot the presenee of a piekup in a forbidden area but 
eannot derive the aetual loeation to whieh poaehers are moving. In the same way, 
alarmed fenees and sensors ean only transmit the loeation of violated entranees or 
forbidden passages. In all these eases a signal implies a restrieted, yet not preeise, 
loealization of the poaehing aetivity. The use of seeurity games in this partieular 
domain is not new (see, for example, (Hll). However, our model allows the eom- 
putation of alarm response strategies for a given alarm system deployed on the 
field. This ean be done by adopting a diseretization of the environment, where 
eaeh target eorresponds to a seetor, values are related to the expeeted population 
of animals in that seetor, and deadlines represent the expeeted eompletion time of 
illegal hunts (these parameters ean be derived from data, as diseussed in [|8l). 

1.1.2. Safety affair sites 

Fairs are large publie events attended by thousands of visitors, where the prob¬ 
lem of guaranteeing safety for the hosting faeilities ean be very hard. For exam¬ 
ple, Expo 2015, the reeent Universal Exposition hosted in Milan, Italy, estimates 
an average of about 100,000 visits per day. This poses the need for earefully ad¬ 
dressing safety risks, whieh ean also derive from planned aet of vandalism or 
terrorist attaeks. Besides seeurity guards patrols, fair sites are often endowed with 
loeally installed monitoring systems. Expo 2015 employs around 200 baffle gates 
and 400 metal deteetors at the entranee of the site. The internal area is eonstantly 
monitored by 4,000 surveillanee eameras and by 700 guards. Eikely, when one or 
more of these deviees/personnel identify a seeurity breaeh, a signal is sent to the 
eontrol room together with a eireumseribed request of intervention. This approaeh 
is required beeause, espeeially in this kind of environments, deteeting a seeurity 
breaeh and neutralizing it are very different tasks. The latter one, in partieular, 
usually requires a greater effort involving speeial equipment and personnel whose 
employment on a demand basis is mueh more eonvenient. Moreover, the deteeting 
loeation of a threat is in many eases different from the loeation where it eould be 
neutralized, making the request of intervention spatially uneertain. Eor instanee, 
eonsider a seeurity guard or a surveillanee eamera deteeting the visitors’ reae- 
tions to a shooting rampage performed by some attaeker. In examples like these, 
we ean restriet the area where the seeurity breaeh happened but no preeise infor¬ 
mation about the loeation ean be gathered sinee the attaeker will probably have 
moved. Our model eould be applied to provide a poliey with whieh sehedule inter¬ 
ventions upon a seeurity breaeh is deteeted in some partieular seetion of the site. 
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In such case, targets eould eorrespond to buildings or other installations where 
visitors ean go. Values and deadlines ean be ehosen aeeording to the importanee 
of targets, their expeeted erowding, and the required response priority. 

1.2. Alarms and security games 

While the problem of managing a sensor network to optimally guard seeurity- 
eritical infrastrueture has been investigated in restricted domains, e.g. [[24ll . the 
problem of integrating alarm signals together with adversarial patrolling is almost 
eompletely unexplored. The only work that ean be classified under this seope 
is [|25ll . The paper proposes a skeleton model of an alarm system where sensors 
have no spatial uneertainty in detecting attacks on single targets. The authors anal¬ 
yse how sensory information ean improve the effeetiveness of patrolling strategies 
in adversarial settings. They show that, when sensors are not affeeted by false 
negatives and false positives, the best strategy preseribes that the patroller just 
responds to an alarm signal rushing to the target under attaek without patrolling 
the environment. As a eonsequenee, in sueh eases the model treatment beeomes 
trivial. On the other hand, when sensors are affected only by false negatives, the 
treatment ean be earried out by means of an easy variation of the algorithm for 
the ease without sensors dH. In the last ease, where false positives are admitted, 
the problem beeomes eomputationally intractable. To the best of our knowledge, 
no previous result dealing with spatial uneertain alarm signals in adversarial pa¬ 
trolling is present in the literature. 

Effeetively exploiting an alarm system and determining a good deployment 
for it (e.g., seleeting the loeation where install sensor) are eomplementary but 
radieally different problems. The results we provide in this work lie in the seope 
of the first one while the treatment of the seeond one is left for future works. 
In other words, we assume that a deployed alarm system is given and we deal 
with the problem of strategically exploiting it at best. Any approaeh to seareh 
for the optimal deployment should, in principle, know how to evaluate possible 
deployments. In such sense, our problem needs to be addressed before one might 
deal with the deployment one. 

1.3. Contributions 

In this paper, we propose the first Seeurity Game model that integrates a 
spatially uneertain alarm system in game-theoretie settings for patrolling]^ Eaeh 


very preliminary short version of the present paper is || 26 |. 
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alarm signal carries the information about the set of targets that ean be under at- 
taek and it is deseribed by the probability of being generated when eaeh target is 
attaeked. Moreover, the Defender ean eontrol only one patroller. The game ean 
be deeomposed in a finite number of finite-horizon subgames, eaeh ealled Signal 
Response Game from v (SRG-u) and eapturing the situation in whieh the De¬ 
fender is in a vertex v and the Attaeker attaeked a target, and an infinite-horizon 
game, ealled Patrolling Game (PG), in whieh the Defender moves in absenee of 
any alarm signal. We show that, when the graph has arbitrary topology, finding the 
equilibrium in eaeh SRG-w is ^'PA’-hard even in the zero-sum ease. We provide 
two exaet algorithms. The first one, based on dynamie programming, performs a 
breadth-first seareh, while the seeond one, based on braneh-and-bound approaeh, 
performs a depth-first seareh. We use the same two approaehes to design two ap¬ 
proximation algorithms. Furthermore, we provide a number of additional results 
for the SRG-n. We study speeial topologies, showing that there is a polynomial 
time algorithm solving a SRG-u on linear and eyele graphs, while it is A/'P-hard 
with trees. Then, we study the PG, showing that when no false positives and no 
missed deteetions are present, the optimal Defender strategy is to stay in a fixed 
loeation, wait for a signal, and respond to it at best. This strategy keeps being 
optimal even when non-negligible missed deteetion rates are allowed. We exper¬ 
imentally evaluate the sealability of our exaet algorithms and we eompare them 
w.r.t. the approximation ones in terms of solution quality and eompute times, in¬ 
vestigating in hard instanees the gap between our hardness results and the theoret- 
ieal guarantees of our approximation algorithms. We show that our approximation 
algorithms provide very high quality solutions even in hard instanees. Finally, we 
provide an example of resolution for a realistie instanee, based on Expo 2015, 
and we show that our exaet algorithms ean be applied for sueh kind of settings. 
Moreover, in our realistie instanee we assess how the optimal patrolling strategy 
eoineides with a statie plaeement even when allowing a false positive rate of less 
or equal to 30%. 

1.4. Paper structure 

In Seetionj^ we introduee our game model. In Seetionj^ we study the prob¬ 
lem of finding the strategy of the Defender for responding to an alarm signal in 
an arbitrary graph while in SeetionQ we provide results for speeifie topologies. 
In Seetion we study the patrolling problem. In Seetion we experimentally 
evaluate our algorithms. In Seotion]^ we briefly diseuss the main Seeurity Games 
researeh direetions that have been explored in the last deeades. Finally, Seetion 
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concludes the paper. [Appendix A includes a notation table, while [Appendix B 
reports some additional experimental results. 


2. Problem statement 


In this seetion we formalize the problem we study. More preeisely, in See- 


tion 2.1 we deseribe the patrolling setting and the game model, while in See¬ 


tion 2.2 we state the computational questions we shall address in this work. 


2.1. Game model 


Initially, in Seetion 2.1.1 we introduce a basie patrolling seeurity game model 


integrating the main features from models eurrently studied in literature. Next, in 


Seetion 2.1.2, we extend our game model by introducing alarm signals. In Sec¬ 
tion [2.1.3 we depiet the game tree of our patrolling seeurity game with alarm 
signals and we deeompose it in notable subgames to faeilitate its study. 


2.1.1. Basic patrolling security game 

As is eustomary in the artifieial intelligenee literature [(H HU, we deal with 
diserete, both in terms of spaee and time, patrolling settings, representing an ap¬ 
proximation of a continuous environment. Specifically, we model the environment 
to be patrolled as an undireeted eonnected graph G = (V, E), where vertices rep¬ 
resent different areas eonnected by various eorridors/roads, formalized through 
the edges. Time is diseretized in turns. We define T C V the subset of sensible 
vertices, called targets, that must be proteeted from possible attaeks. Each target 
f e T is eharaeterized by a value 7r(f) e (0,1] and a penetration time d{t) G N+ 
whieh measures the number of turns needed to eomplete an attack to t. 

Example 1 We report in Figure an example of patrolling setting. Here, V = 
{uo,ui,U 2 ,'y 3 ,'y 4 }, T = where ti = Vifori G {1, 2, 3,4}. A//the 

targets t present the same value Ti{t) and the same penetration time d{t). 

At each turn, an Attacker A and a Defender V play simultaneously having the 
following available aetions: 

• if A has not attaeked in the previous turns, it ean observe the position of T> in 
the graph Cj^and deeide whether to attack a target or to wait for a turn. The 


^Partial observability of A over the position of T> can be introduced as discussed in ED- 
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Figure 1; Example of patrolling setting. 

attack is instantaneous, meaning that there is no delay between the deeision 
to attack and the actual presence of a threat in the selected targej^ 

• V has no information about the aetions undertaken by A in previous turns 
and seleets the next vertex to patrol among those adjaeent to the eurrent one; 
eaeh movement is a non-preemptive traversal of a single edge (v, v') G E 
and takes one turn to be eompleted (along the paper, we shall use cc* to 
denote the temporal eost expressed in turns of the shortest path between any 
V and v' G V). 

The game may eonelude in eorrespondenee of any of the two following events. 
The first one is when V patrols a target t that is under attaek by A from less than 
d{t) turns. In sueh ease the attaek is prevented and A is eaptured. The seeond 
one is when target t is attaeked and V does not patrol t during the d{t) turns that 
follow the beginning of the attaek. In sueh ease the attaek is sueeessful and A 
eseapes without being captured. When A is captured, V reeeives a utility of 1 and 
A reeeives a utility of 0. When an attaek to t has success, V receives 1 — 7r(f) and 
A reeeives vr(f). The game may not eonelude if A deeides to never attaek (namely 
to wait for every turn). In sueh ease, V reeeives 1 and A reeeives 0. Notiee that the 
game is eonstant sum and therefore it is equivalent to a zero-sum game through 
an affine transformation. 

The above game model is in extensive form (being played sequentially), with 
imperfeet information {V not observing the aetions undertaken by A), and with 
infinite horizon {A being in the position to wait forever). The faet that A ean 


'*This is a worst-case assumption according to which A is as strong as possible. It can be 
relaxed by associating execution costs to the Attacker’s actions as shown in 1281 . 
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observe the aetions undertaken by V before acting makes the leader-follower 
equilibrium the natural solution concept for our problem, where V is the leader 
and A is the follower. Since we focus on zero-sum games, the leader’s strategy 
at the leader-follower equilibrium is its maxmim strategy and it can be found by 
employing linear mathematical programming, which requires polynomial time in 
the number of actions available to the players [|29l . 

2.1.2. Introducing alarm signals 

We extend the game model described in the previous section by introducing a 
spatial uncertain alarm system that can be exploited by V. The basic idea is that 
an alarm system uses a number of sensors spread over the environment to gather 
information about possible attacks and raises an alarm signal at any time an attack 
occurs. The alarm signal provides some information about the location (target) 
where the attack is ongoing, but it is affected by uncertainty. In other words, the 
alarm system detects an attack but it is uncertain about the target under attack. 
Formally, the alarm system is defined as a pair {S,p), where S = {si, • • • , 
is a set of m > 1 signals and p : S' x T —)■ [0,1] is a function that specifies the 
probability of having the system generating a signal s given that target t has been 
attacked. With a slight abuse of notation, for a signal s we define T(s) = {f G T | 
p{s I f) > 0} and, similarly, for a target t we have S{t) = {s e S \ p{s | f) > 0}. 
In this work, we assume that: 

• the alarm system is not affected by false positives (signals generated when 
no attack has occurred). Formally, p{s | A) = 0, where A indicates that no 
targets are under attack; 

• the alarm system is not affected by false negatives (signals not generated 
even though an attack has occurred). Formally, p(A | t) = 0, where A 
indicates that no signals have been generated; in Section we will show 
that the optimal strategies we compute under this assumption can preserve 
optimality even in presence of non-negligible false negatives rates. 

Example 2 We report two examples of alarm systems for the patrolling setting 
depicted in Figure^ The first example is reported in Figure ^a). It is a low- 
accuracy alarm system that generates the same signal anytime each target is un¬ 
der attack and therefore the alarm system does not provide any information about 
the target under attack. The second example is reported in Figure^b). It provides 
more accurate information about the localization of the attack than the previous 
example. Here, each target ti, once attacked, generates an alarm signal Si with 
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high probability and a different signal with low probability. That is, if alarm sig¬ 
nal Si has been observed, it is more likely that the attack is in target ti (given a 
uniform strategy of A over the targets). 



(a) Alarm system with a single signal for all the targets. 
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(b) Alarm system with multiple signals. 


Figure 2: Examples of alarm systems. 


Given the presence of an alarm system defined as above, the game mechanism 
changes in the following way. At each turn, before deciding its next move, V 
observes whether or not a signal has been generated by the alarm system and then 
makes its decision considering such information. This introduces in our game a 
node of chance implementing the non-deterministic selection of signals, which 
characterizes the alarm system. 

2.1.3. The game tree and its decomposition 

Here we depict the game tree of our game model, decomposing it in some 
recurrent subgames. A portion of the game is depicted in Figure Such tree can 
be read along the following steps. 

• Root of the tree. A decides whether to wait for a turn (this action is denoted 
by the symbol A since no target is under attack) or to attack a target G T 
(this action is denoted by the label ti of the target to attack). 
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Figure 3: Game tree, v is assumed to the be current vertex for P. r is a collapsed sequence of 
vertices, called route, we shall introduce in the next section. 
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• Second level of the tree. J\f denotes the alarm system, represented by a 
nature-type player. Its behavior is a priori speeified by the eonditional prob¬ 
ability mass funetion p whieh determines the generated signal given the at- 
taek performed by A. In partieular, it is useful to distinguish between two 
oases: 

(a) if no attaok is present, then no signal will be generated under the as¬ 
sumption that p(± I A) = 1; 

{b) if an attaek on target U is taking plaee, a signal s will be drawn from 
S{ti) with probability p(s | ti) (reeall that we assumed p(A | U) = 0). 

• Third level of the tree. V observes the signal raised by the alarm system and 
deoides the next vertex to patrol among those adjaoent to the ourrent one 
(the ourrent vertex is initially ohosen by V). 

• Fourth level of the tree and on. It is useful to distinguish between two oases: 

(a) if no attaek is present, then the tree of the subgame starting from here 
is the same of the tree of the whole game, exeept for the position of V 
that may be different from the initial one; 

(h) if an attaok is taking plaee on target F, then only V can aet. 

Suoh game tree ean be deoomposed in a number of finite reourrent subgames 
suoh that the best strategies of the agents in eaeh subgame are independent from 
those in other subgames. This deoomposition allows us to apply a divide et impera 
approaoh, simplifying the resolution of the problem of finding an equilibrium. 
More preoisely, we denote with T one of these subgames. We define T as a game 
subtree that ean be extraoted from the tree of Figure as follows. Given "D’s 
ourrent vertex v E V, seleet a deoision node for A and oall it i. Then, extraet the 
subtree rooted in i disearding the branoh oorresponding to aetion A (no attaek^ 


^Rigorously speaking, our definition of subgame is not compliant with the definition provided 
in game theory ll^ . which requires that all the actions of a node belong to the same subgame (and 
therefore we could not separate action A from actions ti). However, we can slightly change the 
structure of our game making our definition of subgame compliant with the one from game theory. 
More precisely, it is sufficient to split each node of A into two nodes: in the first A decides to 
attack a target or to wait for one turn, and in the second, conditioned to the fact that A decided to 
attack, A decides which target to attack. This way, the subtree whose root is the second node of A 
is a subgame compliant with game theory. It can be easily observed that this change to the game 
tree structure does not affect the behaviour of the agents. 
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Intuitively, such subgame models the players interaction when the Defender is in 
some given vertex v and the Attacker will perform an attack. As a consequence, 
each r obtained in such way is finite (once an attack on t started, the maximum 
length of the game is d{t)). Moreover, the set of different Fs we can extract is 
finite since we have one subgame for each possible current vertex for V, as a 
consequence we can extract at most \V\ different subgames. Notice that, due to 
the infinite horizon, each subgame can recur an infinite number of times along the 
game tree. However, being such repetitions independent and the game zero-sum, 
we only need to solve one subgame to obtain the optimal strategy to be applied 
in each of its repetitions. In other words, when assuming that an attack will be 
performed, the agents’ strategies can be split in a number of independent strategies 
solely depending on the current position of the Defender. The reason why we 
discarded the branch corresponding to action A in each subgame is that we seek 
to deal with such complementary case exploiting a simple backward induction 
approach as explained in the following. 

First, we call Signal Response Game from v the subgame F defined as above 
and characterized by a vertex v representing the current vertex of V (for brevity, 
we shall use SRG-u). In an SRG-f, the goal of V is to find the best strategy start¬ 
ing from vertex v to respond to any alarm signal. All the SRG-ns are independent 
one each other and thus the best strategy in each subgame does not depend on 
the strategies of the other subgames. The intuition is that the best strategies in an 
SRG-u does not depend on the vertices visited by V before the attack. Given an 
SRG-n, we denote by the strategy of V once observed signal s, by aj’ the 

strategy profile aj’ = • • •, strategy of A. Let us 

notice that in an SRG-u, given a signal s, V is the only agent that plays and there¬ 
fore each sequence of moves between vertices of V in the tree can be collapsed in 
a single action. Thus, SRG-u is essentially a two-level game in which A decides 
the target to attack and V decides the sequence of moves to visit the targets. 

Then, according to classical backward induction arguments [|30l, once we have 
found the best strategies of each SRG-u, we can substitute the subgames with the 
agents’ equilibrium utilities and then we can find the best strategy of V for pa¬ 
trolling the vertices whenever no alarm signal has been raised and the best strategy 
of attack for A. We call this last problem Patrolling Game (for conciseness, we 
shall use PG). We denote by and the strategies of V and A respectively in 
the PG. 
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2.2. The computational questions we pose 

In the present paper, we focus on some questions whose answers play a funda¬ 
mental role in the design of the best algorithms to find an equilibrium of our game. 
More precisely, we investigate the computational complexity of the following four 
problems. The first problem concerns the PG. 

Question 1 Which is the best patrolling strategy for V maximizing its expected 
utility? 

The other three questions concern SRG-n. For the sake of simplicity, we focus 
on the case in which there is only one signal s, we shall show that it is possible to 
scale linearly in the number of signals. 

Question 2 Given a starting vertex v and a signal s, is there any strategy of V 
that allows T> to visit all the targets in T{s), each within its deadline? 

Question 3 Given a starting vertex v and a signal s, is there any pure strategy of 
V giving V an expected utility of at least k? 

Question 4 Given a starting vertex v and a signal s, is there any mixed strategy 
ofV giving V an expected utility of at least k? 

In the following, we shall take a bottom-up approach answering the above 
questions starting from the last three and then dealing with the first one at the 
whole-game level. 


3. Signal response game on arbitrary graphs 

In this section we show how to deal with an SRG-w on arbitrary graphs. 
Specifically, in Section 3.1 we prove the hardness of the problem, analyzing its 
computational complexity. Then, in Section 3.2 and in Section 3.3 we propose 
two algorithms, the first based on dynamic programming (breadth-first search) 
while the second adopts a branch and bound (depth-first search) paradigm. Fur¬ 
thermore, we provide a variation for each algorithm, approximating the optimal 
solution. 
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3.1. Complexity results 

In this section we analyse SRG-v from a computational point of view. We 
initially observe that each SRG-n is characterized by |T| actions available to A, 
each corresponding to a target t, and by decision nodes of V. 

The portion of game tree played by V can be safely reduced by observing that 
V will move between any two targets along the minimum path. This allows us 
to discard from the tree all the decision nodes where V occupies a non-target 
vertex. However, this reduction keeps the size of the game tree exponential in 
the parameters of the game, specifically 0(|T|l^l)j^The exponential size of the 
game tree does not constitute a proof that finding the equilibrium strategies of an 
SRG-n requires exponential time in the parameters of the game because it does 
not exclude the existence of some compact representation of V’s strategies, e.g., 
Markovian strategies. Indeed such representation should be polynomially upper 
bounded in the parameters of the game and therefore they would allow the design 
of a polynomial-time algorithm to find an equilibrium. We show below that it is 
unlikely that such a representation exists in arbitrary graphs, while it exists for 
special topologies as we shall discuss later. 

We denote by the expected utility of A from SRG-u and therefore the ex¬ 
pected utility of "D is 1 — g^. Then, we define the following problem. 

Definition 1 (k-SRG-v) The decision problem k-SRG-v is defined as: 
INSTANCE: an instance ofSRG-v; 

QUESTION: is there any such that gv < k (when A plays its best response)? 

Theorem 1 k-SRG-v is strongly J\fV-hard even when [S'! = 1. 

Proof. Let us consider the following reduction from HAMILTONIAN-PATH [1311 . 
Given an instance of HAMILTONIAN-PATH Gh = (Vh, Eh), we build an in¬ 
stance for k-SRG-f as: 

.V = VhC {v}-, 

. E = EHG{{v,h),yheVH}; 

• T = Vh; 

• d{t) = \Vh\; 


more accurate bound is 0 (|T|™‘"fTl.maxt{d(t)}}^ 
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• n(t) G (0,1], for alH G T (any value); 

• S = {si}; 

• T(si) = T; 

• Pi^i I f) = for all t eT; 

• k = 0. 

If gs < 0, then there must exist a path starting from v and visiting all the targets 
in T by d = \Vh\- Given the penetration times assigned in the above construction 
and recalling that edge costs are unitary, the path must visit each target exactly 
once. Therefore, since T = Vh, the game’s value is less than or equal to zero if 
and only if Gh admits a Hamiltonian path. This concludes the proof. □ 

Notice that the problem of assessing the membership of k-SRG-v to MV 
is left open and it strictly depends on the size of the support of the strategy of 
a^. That is, if any strategy has a polynomially upper bounded support, then 
k-SRG-v is in J\fV. We conjecture it is not and therefore there can be optimal 
strategies in which an exponential number of actions are played with strictly pos¬ 
itive probability. Furthermore, the above result shows that with arbitrary graphs: 

• answering to Question 1 is J^MV-hard, 

• answering to Questions 2, 3, 4 is AfV-hard. 

As a consequence a polynomial-time algorithm solving those problems is unlikely 
to exist. In particular, the above proof shows that we cannot produce a compact 
representation (a.k.a. information lossless abstractions) of the space of strategies 
of V that is smaller than unless there is an algorithm better than the 

best-known algorithm for HAMILTONIAN-PATH. This is due to the fact that 
the best pure maxmin strategy can be found in linear time in the number of the 
pure strategies and the above proof shows that it cannot be done in a time less than 
(9(2l'^(^)l). More generally, no polynomially bounded representation of the space 
of the strategies can exist, unless V = MV. 

Although we can deal only with exponentially large representations of P’s 
strategies, we focus on the problem of finding the most efficient representation. 
Initially, we provide the following definitions. 

Definition 2 (Route) Given a starting vertex v and a signal s, a route (over the 
targets) r is a generic sequence of targets ofT{s) such that: 


16 



• r starts from v, 

• each target t G T(s) occurs at most once in r (but some targets may not 
occur), 

• r(i) is the i-th visited target in r (in addition, r(0) = v). 

Among all the possible routes we restriet our attention on a speeial elass of routes 
that we eall covering and are defined as follows. 

Definition 3 (Covering Route) Given a starting vertex v and a signal s, a route 
r is covering when, denoted by Ar{r{i)) = J2^h=o^r{h) r{h+i) needed by 

V to visit target t = r(i) G T(s) starting from r(0) = v and moving along 
the shortest path between each pair of consecutive targets in r, for every target t 
occurring in r it holds Ar(r(i)) < d(r(i)) (i.e., all the targets are visited within 
their penetration times). 

With a slight abuse of notation, we denote by T(r) the set of targets eovered 
by r and we denote by c(r) the total temporal eost of r, i.e., c(r) = Ar(r(|T(r)|)). 
Notice that in the worst case the number of covering routes is 0(|T(s)P*^*^l), but 
computing all of them may be unnecessary since some covering routes will never 
be played by V due to strategy domination and therefore they can be safely dis¬ 
carded [f3^ . More precisely, we introduce the following two forms of dominance. 

Definition 4 (Intra-Set Dominance) Given a starting vertex v, a signal s and 
two different covering routes r, r' such that T(r) = T(r'), if c(r) < c(r') then r 
dominates r'. 

Definition 5 (Inter-Set Dominance) Given a starting vertex v, a signal s and 
two different covering routes r, r', ifT(r) D T(r') then r dominates r'. 

Furthermore, it is convenient to introduce the concept of covering set, which 
is strictly related to the concept of covering route. It is defined as follows. 

Definition 6 (Covering Set) Given a starting vertex v and a signal s, a covering 
set Q is a subset of targets T(s) such that there exists a covering route r with 
T(r) = g. 

Let us focus on Definition It suggests that we can safely use only one route 
per covering set. Covering sets suffice for describing all the outcomes of the game, 
since the agents’ payoffs depend only on the fact that A attacks a target t that is 
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covered by V or not, and in the worst ease are with a remarkable redue- 

tion of the seareh spaee w.r.t. 0(|T(s)|l^^^^l). However, any algorithm restrieting 
on eovering sets should be able to determine whether or not a set of targets is a 
eovering one. Unfortunately, this problem is hard too. 

Definition 7 (COV-SET) The decision problem COV-SET is defined as: 
INSTANCE: an instance ofSRG-v with a target set T; 

QUESTION: is T a covering set for some covering route r? 

By trivially adapting the same reduetion for Theorem we ean state the fol¬ 
lowing theorem. 

Theorem 2 COV-SET is AfV-complete. 

Therefore, eomputing a eovering route for a given set of targets (or deeiding 
that no eovering route exists) is not doable in polynomial time unless V = AfV. 
This shows that, while eovering sets suffiee for defining the payoffs of the game 
and therefore the size of payoffs matrix ean be bounded by the number of eover- 
ing sets, in praetiee we also need eovering routes to eertifieate that a given subset 
of targets is eovering. Thus, we need to work with eovering routes, but we just 
need the routes eorresponding to the eovering sets, limiting the number of eover¬ 
ing routes that are useful for the game to the number of eovering sets. In addition. 
Theorem suggests that no algorithm for COV-SET ean have eomplexity better 
than unless there exists a better algorithm for HAMILTONIAN-PATH 

than the best algorithm known in the literature. This seems to suggest that enu¬ 
merating all the possible subsets of targets (eorresponding to all the potential eov¬ 
ering sets) and, for eaeh of them, eheeking whether or not it is eovering requires 
a eomplexity worse than 0(2l^(^)l). Surprisingly, we show in the next seetion that 
there is an algorithm with eomplexity 0(2l^(^)l) (negleeting polynomial terms) 
to enumerate all and only the eovering sets and, for eaeh of them, one eovering 
route. Therefore, the eomplexity of our algorithm matehes (negleeting polynomial 
terms) the eomplexity of the best-known algorithm for HAMILTONIAN-PATH. 

Let us foeus on Definition Inter-Set dominanee ean be leveraged to intro- 
duee the eoneept of maximal eovering sets whieh eould enable a further reduetion 
in the set of aetions available to T). 

Definition 8 (MAXIMAL COV-SET) Given a covering set Q (where Q = T(r) 
for some r), we say that Q is maximal if there is no route r' such that Q C T{r' ). 
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Furthermore, we say that r sueh that T(r) = Q is a maximal eovering route. In 
the best ease, when there is a route eovering all the targets, the number of maximal 
eovering sets is 1, while the number of eovering sets (ineluding the non-maximal 
ones) is Thus, eonsidering only maximal eovering sets allows an exponen¬ 

tial reduetion of the payoffs matrix. In the worst ease, when all the possible subsets 
eomposed of |T(s)|/2 targets are maximal eovering sets, the number of maximal 
eovering sets is while the number of eovering sets is al¬ 

lowing a reduetion of the payoffs matrix by a faetor of 2. Furthermore, if we knew 
a priori that Q is a maximal eovering set, we eould avoid searehing for eovering 
routes for any set of targets that strietly eontains Q. When designing an algorithm 
to solve this problem. Definition eould then be exploited to introduee some kind 
of pruning teehnique to save average eompute time. However, the following result 
shows that deeiding if a eovering set is maximal is hard. 

Definition 9 (MAX-COV-SET) The decision problem MAX-COV-SET is de¬ 
fined as: 

INSTANCE: an instance ofSRG-v with a target set T and a covering set T' C T; 
QUESTION: is T' maximal? 

Theorem 3 There is no polynomial-time algorithm for MAX-COV-SET unless 
V = MV. 

Proof. Assume for simplieity that S = {si} and that T(si) = T. Initially, we 
observe that MAX-COV-SET is in co-AfV. Indeed, any eovering route r sueh 
that T(r) D T' is a NO eertifieate for MAX-COV-SET, plaeing it in co-AfV. 
(Notiee that, trivially, any eovering route has length bounded by 0(|Tp); also, 
notiee that due to Theorem 2, having a eovering set would not suffiee given that we 
eannot verify in polynomial time whether it is aetually eovering unless V = AfV.) 

Eet us suppose we have a polynomial-time algorithm for MAX-COV-SET, 
ealled A. Then (sinee V C AfV fl co-AfV) we have a polynomial algorithm for 
the eomplement problem, i.e., deeiding whether all the eovering routes for T' are 
dominated. Eet us eonsider the following algorithm: given an instanee for COV- 
SET speeified by graph G = (V, E), a set of target T with penetration times d, 
and a starting vertex v: 

1. assign to targets in T a lexieographie order fi, ^ 2 , • • • , t\T\ ; 

2. for every t G T, verify if {f} is a eovering set in 0(|T|) time by eompar- 
ing ujft and d{t); if at least one is not a eovering set, then output NO and 
terminate; otherwise set T = {fi} and k = 2; 
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3. apply algorithm A on the following instance: graph G = (V, E), target set 
{TU{4}, d} (where disd restricted to TLi{tk}), start vertex v, and covering 
set T; 

4. if A’s output is YES (that is, T is not maximal) then set T = T U {tk}, 
k = k + 1 and restart from step 3; if A’s output is NO and k = |T| then 
output YES; if A’s output is NO and k < |T| then output NO; 

Thus, the existence of A would imply the existence of a polynomial algorithm for 
COV-SET which (under V ^ MV) would contradict Theorem 2. This concludes 
the proof. □ 

Nevertheless, we show hereafter that there exists an algorithm enumerating 
all and only the maximal covering sets and one route for each of them (which 
potentially leads to an exponential reduction of the time needed for solving the 
linear program) with only an additional polynomial cost w.r.t. the enumeration of 
all the covering sets. Therefore, neglecting polynomial terms, our algorithm has a 
complexity of 0(2l^^^)l). 

Einally, we focus on the complexity of approximating the best solution in an 
SRG-u. When V restricts its strategies to be pure, the problem is clearly not ap- 
proximable in polynomial time even when the approximation ratio depends on 
|T(s) I . The basic intuition is that, if a game instance admits the maximal covering 
route that covers all the targets and the value of all the targets is 1, then either 
the maximal covering route is played returning a utility of 1 to P or any other 
route is played returning a utility of 0, but no polynomial-time algorithm can find 
the maximal covering route covering all the targets, unless V = MV. On the 
other hand, it is interesting to investigate the case in which no restriction to pure 
strategies is present. We show that the problem keeps being hard. 

Theorem 4 The optimization version of k-SRG-v, say OPT-SRG-v, is AVPC- 
hard even in the restricted case in which the graph is metric, there is only one 
signal s, all targets t &T[s) have the same penetration time d{t), and there is the 
maximal covering route covering all the targets. 

Proof. We produce an approximation-preserving reduction from TSP(1,2) that is 
known to be AVX-hdsA O^ . Eor the sake of clarity, we divide the proof in steps. 

TSP(1,2) instance. An instance of undirected TSP(1,2) is defined as follows: 

• a set of vertices Vtsp, 

• a set of edges composed of an edge per pair of vertices. 
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• a symmetric matrix Ctsp of weights, whose values can be 1 or 2, each as¬ 
sociated with an edge and representing the cost of the shortest path between 
the corresponding pair of vertices. 

The goal is to find the minimum cost tour. Let us denote by OPTSOLtsp and 
OPTtsp the optimal solution of TSP(1,2) and its cost, respectively. Further¬ 
more, let us denote by APXSOLtsp and APXtsp an approximate solution of 
TSP(1,2) and its cost, respectively. It is known that there is no polynomial-time 
approximation algorithm with APXtsp/OPT apx < « for some a > 1, unless 

v = Mvm^. 

Reduction. We map an instance of TSP(1,2) to a specific instance of SRG-w 
as follows: 


• there is only one signal s. 


• T{s) = Vtsp, 

• = CTSp{t, t'), for every t, t' G T(s), 

• n{t) = 1, for every t G T(s), 


• w* f- = 1, for every t G T(s), 


• d{t) 


OPTtsp 
OPTtsp — 1 


if OPTtsp — 

if O PTtsp ^ 


, for every t eT{s). 

\Vtsp\ 


In this reduction, we use the value of OPTtsp even if there is no polynomial-time 
algorithm solving exactly TSP(1,2), unless V = AfV. We show below that with 
an additional polynomial-time effort we can deal with the lack of knowledge of 
OPTtsp- 

OPT-SRG-v optimal solution. By construction of the SRG-n instance, there is 
a covering route starting from v and visiting all the targets t G T(s) , each within its 
penetration time. This route has a cost of exactly d{t) and it is (v, ti,..., f|r(s)|)> 
where (fi,... ,t\T{s)\Ai) corresponds to OPTSOLtsp with the constraint that 
= 2 if OPTtsp > |Vtsp| (essentially, we transform the tour in a path 

I ^ ('®) I ’ f 

by discarding one of the edges with the largest cost). Therefore, the optimal so¬ 
lution of SRG-w, say OPTSOLsrg, prescribes to play the maximal route with 
probability one and the optimal value, say OPTsrg, is 1. 

OPT-SRG-v approximation. Let us denote by APXSOLsrg and APXsrg 
an approximate solution of OPT-SRG-n and its value, respectively. We assume 
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there is a polynomial-time approximation algorithm with APXsrg/OPTsrg > 
13 where (3 e (0,1). Let us notice that APXSOLsrg prescribes to play a polyno- 
mially upper bounded number of covering routes with strictly positive probability. 
We introduce a lemma that characterizes such covering routes. 

Lemma 5 The longest covering route played with strictly positive probability in 
APXSOLsrg visits at least /3|T(s)| targets. 

Proof. Assume by contradiction that the longest route visits /3\T{s) | — 1 targets. 
The best case in terms of maximization of the value of OPT-SRG-n is, due to 
reasons of symmetry (all the targets have the same value), when there is a set 
of \T{s)\ covering routes of length /)|T(s)| — 1 such that each target is visited 
exactly by l3\T{s)\ — 1 routes. When these routes are available, the best strategy 
is to randomize uniformly over the routes. The probability that a target is covered 
is /3 — and therefore the value of APXrrg is /) — This leads to a 
contradiction, since the algorithm would provide an approximation strictly smaller 
than 13. □ 

TSP(1,2) approximation from OPT-SRG-v approximation. We use the above 
lemma to show that we can build a (3—2/))-approximation for TSP(1,2) from a {3- 
approximation of OPT-SRG-n. Given an APXSOLsrg, we extract the longest 
covering route played with strictly positive probability, say ... ,f/ 3 |r(s)|)- 

The route has a cost of at most d{f), it would not cover /)|T(s)| targets other¬ 
wise. Any tour (fi,..., f/3|r(s)|, f/3|T(s)|-i-i, • • •, i\T{s)\Ai) has a cost not larger than 
d{t) — 1 + 2(1 — /))|T(s)| = OPTtsp — 1 + 2(1 — I3)\Vtsp\ (under the worst 
case in which all the edges in {ti 3 \T{s)\Ai 3 \T{s)\+i, ■ ■ ■ A\T{s)\,ti) have a cost of 2). 
Given that OPTrsp > |Vtsp|, we have that such a tour has a cost not larger 
than OPTtsp — 1 + 2(1 — I3)\Vtsp\ < OPTxspi^) — 2/3). Therefore, the tour 
is a (3 — 2/3)-approximation for TSP(1,2). Since TSP(1,2) is not approximable 
in polynomial time for any approximation ratio smaller than a, we have the con¬ 
straint that 3 — 2/3 > a, and therefore that (3 < Since a > 1, we have that 
< 1 and therefore that there is no polynomial-time approximation algorithm 
for OPT-SRG-n when [3 G 1), unless V = J\fV. 

OPTtsp oracle. In order to deal with the fact that we do not know OPTtsp, 
we can execute the approximation algorithm for OPT-SRG-n using a guess over 
OPTtsp- More precisely, we execute the approximation algorithm for every value 
in {\Vtsp\, • • •, 2|Vts'p|} and we return the best approximation found for TSP( 1,2). 
Given that OPTts p ^ {I Vts p I, • • •, 21 Vts p |}, there is an execution of the approx¬ 
imation algorithm that uses the correct guess. □ 
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We report some remarks to the above theorem. 

Remark 1 The above result does not exclude the existence of constant-ratio ap¬ 
proximation algorithms for OPT-SRG-v. We conjecture that it is unlikely. OPT- 
SRG-v presents similarities with the (metric) DEADLINE-TSP, where the goal 
is to find the longest path of vertices each traversed before its deadline. The 
DEADLINE-TSP does not admit any constant-ratio approximation algorithm /[34l/ 
and the best-known approximation algorithm has logarithmic approximation ra¬ 
tio / f35l/ . The following observations can be produced about the relationships be¬ 
tween OPT-SRG-v and DEADLINE-TSP: 

• when the maximal route covering all the targets in the OPT-SRG-v exists, 
the optimal solution of the OPT-SRG-v is also optimal for the DEADLINE- 
TSP applied to the same graph; 

• when the maximal route covering all the targets in the OPT-SRG-v does 
not exist, the optimal solutions of the two problems are different, even when 
we restrict us to pure-strategy solutions for the OPT-SRG-v; 

• approximating the optimal solution of the DEADLINE-TSP does not give 

a direct technique to approximate OPT-SRG-v, since we should enumerate 
all the subsets of targets and for each subset of targets we would need to 
execute the approximation of the DEADLINE-TSP, but this would require 
exponential time. We notice in addition that even the total number of sets 
of targets with logarithmic size is not polynomial, being (1^1))^ and 

therefore any algorithm enumerating them would require exponential time; 

• when the optimal solution of the OPT-SRG-v is randomized, examples of 
optimal solutions in which maximal covering routes are not played can be 
produced, showing that at the optimum it is not strictly necessary to play 
maximal covering routes, but even approximations suffice. 

Remark 2 If it is possible to map DEADLINE-TSP instances to OPT-SRG-v 
instances where the maximal covering route covering all the targets exists, then 
it trivially follows that OPT-SRG-v does not admit any constant-approximation 
ratio. We were not able to find such a mapping and we conjecture that, if there is 
an approximation-preserving reduction from DEADLINE-TSP to OPT-SRG-v, 
then we cannot restrict to such instances. The study of instances of OPT-SRG-v 
where mixed strategies may be optimal make the treatment very challenging. 


23 


3.2. Dynamic-programming algorithm 

We start by presenting two algorithms. The first one is exaet, while the see- 
ond one is an approximation algorithm. Both algorithms are based on a dynamie 
programming approaeh. 

2.2.1. Exact algorithm 

Here we provide an algorithm based on the dynamic programming paradigm 
returning the set of strategies available to V when it is in v and receives a signal 
s. The algorithm we present in this section enumerates all the covering sets and, 
for each of them, it returns also the corresponding covering route. Initially, we 
observe that we can safely restrict our attention to a specific class of covering sets, 
that we call proper, defined as follows. 

Definition 10 (Proper Covering Set) Given a starting vertex v and a signal s, a 
covering set Q is proper if there is a route r such that, once walked (along the 
shortest paths) over graph G, it does not traverse any target t E T(s)\ T(r). 

While in the worst case the number of proper covering sets is equal to the num¬ 
ber of covering sets (consider, for example, fully connected graphs with unitary 
edge costs) in realistic scenarios we expect that the number of proper covering 
sets is much smaller than the number of covering sets. As we show in Section 5, 
restricting to proper covering sets makes the complexity of our algorithm poly¬ 
nomial with respect to some special topologies: differently from the number of 
covering sets, the number of proper covering sets is polynomially upper bounded. 
Hereafter we provide the description of the algorithm. 

Let us denote ^ a collection of proper covering sets, where each set in this 
collection is denoted as The proper covering set Qj ^ has cardinality k and 
admits a covering route r whose starting vertex is v and whose last covered target 
is t. Each ^ is associated with a cost c{Q^ f) representing the temporal cost 
of the shortest covering route for ^ that specifies t as the k-th target to visit. 
Upon this basic structure, our algorithm iteratively computes proper covering sets 
collections and costs for increasing cardinalities, that is from k = 1 possibly up to 
k = |T(s)| including one target at each iteration. Using a dynamic programming 
approach, we assume to have solved up to cardinality k — 1 and we specify how 
to complete the task for cardinality k. Detailed steps are reported in Algorithm 
while in the following we provide an intuitive description. Given Qy~(^, we can 
compute a set of targets (Line[^ that is a subset of T(s) such that for each 
target f E Q~^ the following properties hold: 
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t' ^ Ql:t\ 


• if t' is appended to the shortest eovering route for Q^t^, it will be visited 
before d{t'), 

• the shortest path between t and t' does not traverse any target t” G T{s)\ 

Qlf- 

Funetion ShortestPath{G, t, t') returns the shortest path on G between t and 
t'. For effieieney, we ealeulate (in polynomial time) all the shortest paths offline by 
means of the Floyd-Warshall algorithm Obll . If Q+ is not empty, for eaeh t' G , 
we extend (Stepj^ by ineluding it and naming the resulting eovering set as 
sinee it has eardinality k and we know it admits a eovering route with last 
vertex t'. Sueh route ean be obtained by appending t' to the eovering route for 
Q^~t^ and has eost This value is assumed to be the eost of the 

extended proper eovering set.—In Step |9} we make use of a proeedure ealled 
Search{Q, G) where Q is a eovering set and C is a eolleetion of eovering sets. 
The proeedure outputs Q if Q G G and 0 otherwise. We adopted an effieient 
implementation of sueh proeedure whieh ean run in 0(|T(s)|). More preeisely, 
we represent a eovering set Q as a binary veetor of length |T(s)| where the i-th 
eomponent is set to 1 if target ti E Q and 0 otherwise. A eolleetion of eovering 
sets G ean then be represented as a binary tree with depth |T(s) |. The membership 
of a eovering set Q to eolleetion G is represented with a braneh of the tree in sueh 
a way that if U E Q then we have a left edge at depth i — 1 on sueh braneh. We 
ean easily determine if Q G C by eheeking if traversing a left (right) edge in the 
tree eaeh time we read a 1 (0) in Q’s binary veetor we reaeh a leaf node at depth 
|T(s)|. The insertion of a new eovering set in the eolleetion ean be done in the 
same way by traversing existing edges and expanding the tree where neeessary.— 
If sueh extended proper eovering set is not present in eolleetion G^^, or is already 


and[T^. After the iteration for eardinality k is eompleted, for eaeh proper eovering 
set Q in eolleetion G^^, c{Q) represents the temporal eost of the shortest eovering 
route with t as last target. 

After Algorithm [T] eompleted its exeeution, for any arbitrary T' C T we ean 
easily obtain the temporal eost of its shortest eovering route as 

c*{T') = min c{Q) 


present with a higher eost (StepH^, then eolleetion and eost are updated (Steps 11 
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Algorithm 1 DP-ComputeCovSets(v, s) 


1 : VteT(s),ke{2,...,IT(s)l}:Ci,= 


3 

4 

5 

6 

7 

8 

9 

10 
11 
12 

13 

14 

15 

16 

17 

18 


Vt e T(s}: c({i}) = 


for all fc e {2.. . |r(s)|} do 

for all t S T(s) do 




{i} if (v, t)eEii ^0 
0 othei-wise ’ 

, c(0) = oo 


for all 


6C" 


do 


Q+ = |t' e T{s) \ : (c(qS“^) + CJ* < d(f')) A ( fit" e T(s) \Q:t''e ShortestPath(G, t, f')) } 

for all t' S do 


U = Search{Ql^^,,Cl^,) 
ifc(i7)>c(QS;^i) + a;*j, then 

c(QS,,,) = c(Q„V)+<.,V 

end if 
end for 
end for 
end for 
end for 

return : t e T{s),k< |r(s)|} 


\n-i /1 

where Y\t'\ = Ut£T{Search(T',Cli-')} (notiee that if T' is not a eovering set 
then c*{T') = oo). For the sake of simplieity, Algorithm does not speeify how 
to earry out two sub-tasks we deseribe in the following. 

The first one is the annotation of dominated (proper) covering sets. Eaeh time 
Steps and are exeeuted, a eovering set is added to some eolleetion. Let us 
eall it Q and assume it has eardinality k. Eaeh time a new Q has to be ineluded at 
eardinality k, we mark all the eovering sets at eardinality k — 1 that are dominated 
by Q (as per Definition [^. The number of sets that ean be dominated is in the 
worst ease \Q\ sinee eaeh of them has to be searehed in eolleetion for eaeh 
feasible terminal t and, if found, marked as dominated. The number of terminal 
targets and the eardinality of Q are at most n and, as deseribed above, the Search 
proeedure takes 0(|T(s)|). Therefore, dominated (proper) eovering sets ean be 
annotated with a 0(|T(s)|^) extra eost at eaeh iteration of AlgorithmWe ean 
only mark and not delete dominated (proper) eovering sets sinee they ean generate 
non-dominated ones in the next iteration. 

The seeond task is the generation of routes. Algorithm foeuses on proper 
eovering sets and does not maintain a list of eorresponding routes. In faet, to build 
the payoffs matrix for SRG-v we do not strietly need eovering routes sinee eov¬ 
ering sets would suffiee to determine payoffs. However, we do need them opera- 
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tively since V should know in which order targets have to be covered to physically 
play an action. This task can be accomplished by maintaining an additional list of 
routes where each route is obtained by appending terminal vertex t' to the route 
stored for when set U {f'} is included in its corresponding collection. 
At the end of the algorithm only routes that correspond to non-dominated (proper) 
covering sets are returned. Maintaining such a list introduces a 0(1) cost. 

Theorem 6 The worst-case complexity of Algorithm^is 0(|T(s)|^2l^*^^^l) since 
it has to compute proper covering sets up to cardinality |T(s)|. With annotations 
of dominances and routes generation the whole algorithm yields a worst-case 
complexity o/0(|T(s)|^2l^(®)l). 

3.2.2. Approximation algorithm 

The dynamic programming algorithm presented in the previous section can¬ 
not be directly adopted to approximate the maximal covering routes. We notice 
that even in the case we introduce a logarithmic upper bound over the size of 
the covering sets generated by Algorithm we could obtain a number of routes 
that is 0(2^°^ (l^(s)l))^ and therefore exponential. Thus, our goal is to design a 
polynomial-time algorithm that generates a polynomial number of good covering 
routes. We observe that if we have a total order over the vertices and we work 
over the complete graph of the targets where each edge corresponds to the short¬ 
est path, we can find in polynomial time the maximal covering routes subject to 
the constraint that, given any pair of targets t, t' in a route, t can precede t' in the 
route only if t precedes t' in the order. We call monotonic a route satisfying a given 
total order. Algorithm [^returns the maximal mono tonic covering routes when the 
total order is lexicographic (trivially, in order to change the order, it is sufficient 
to re-label the targets). 

Algorithm]^ is based on dynamic programming and works as follows. R{k,l) 
is a matrix storing in each cell one route, while L{k,l) is a matrix storing in 
each cell the maximum lateness of the corresponding route, where the lateness 
associated with a target t is the difference between the (first) arrival time at t 
along r and d{t) and the maximum lateness of the route is the maximum lateness 
of the targets covered by the route. The route stored in R{k,l) is the one with the 
minimum lateness among all the monotonic ones covering I targets where tk is the 
first visited target. Thus, basically, when I = 1, R{k, 1) contains the route (n, tf), 
while, when I > 1, R(k, 1) is defined appending to i?(/c, 1) the best (in terms of 
minimizing the maximum lateness) route R{k'., I — 1) for every k' > k, in order to 
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satisfy the total order. The whole set of routes in R are returnedj^The eomplexity 
of Algorithm]^ is 0(|T(s) |^), exeept the time needed to find all the shortest paths. 


Algorithm 2 MonotonieLongestRoute(u 

,s) 

1: 

Vfc, fc' e {1, 2,... , \T(s)\}, R{k, k') = 0, L{k, k') = 

+00, Cj{{k) = 0, C^ik) = +00 

2: 

forallVfc e {|r(s)|, |T(s)| - 1,. .., 1} do 


3: 

forallVZe{l,2,...,|r(s)|}do 


4: 

if Z = 1 then 


5: 

R(k,l) = {v,tk) 


6: 

L{k,l) = - d{tk) 


7: 

else 


8: 

for all Zc' s.t. |T(s)| > k' > |r(s)| - Zs do 


9: 

Cfi(Zc') = {R{kp),R(k',l-l)) 


10 

: CL{k') = raa;K{L{k,l),w*^^ + w*^ 

fc' 1)1 

11 

: end for 


12 

: i = argminj{CiO)} 


13 

: ifCiO ) < Othen 


14 

: R(k,l)^CR{j) 


15 

: L{k,l)^CL{j) 


16 

: end if 


17 

: end if 


18 

: end for 


19 

: end for 


20 

: return R 



We use different total orders over the set of targets, eolleeting all the routes 
generated using eaeh total order. The total orders we use are (where ties are broken 
randomly): 

• inereasing order te* the rationale is that targets elose to v will be visited 
before targets far from v; 

• inereasing order dy^t' the rationale is that targets with short deadlines will 
be visited before targets with long deadlines; 


• inereasing order t — wl^: the rationale is that targets with short exeess 
time will be visited before targets with long exeess time. 


In addition, we use a sort of random restart, generating random permutations over 
the targets. 


Theorem 7 


Algorithm^provides an approximation with ratio f2(|7^^). 


^We notice that dominance can be applied to discard dominated routes. However, in this 
case, the improvement would be negligible since the total number of routes, including the non- 
dominated ones, is polynomial. 
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Proof sketch. The worst case for the approximation ratio of our algorithm occurs 
when the covering route including all the targets exists and each covering route 
returned by our heuristic algorithm visits only one target. In that case, the optimal 
expected utility of P is 1. Our algorithm, in the worst case in which Ti{t) = 1 for 
every target t, returns an approximation ratio It is straightforward to see 

that, in other cases, the approximation ratio is larger. □ 


3.3. Branch-and-bound algorithms 

The dynamic programming algorithm presented in the previous section essen¬ 
tially implements a breadth-first search. In some specific situations, depth-first 
search could outperform breadth-first search, e.g., when penetration times are 
relaxed and good heuristics lead a depth-first search to find in a brief time the 
maximal covering route, avoiding to scan an exponential number of routes as the 
breadth-first search would do. In this section, we adopt the branch-and-bound 
approach to design both an exact algorithm and an approximation algorithm. In 


particular, in Section 3.3.1 we describe our exact algorithm, while in Section 3.3.2 
we present the approximation one. 


3.3.1. Exact algorithm 

Our branch-and-bound algorithm (see Algorithm is a tree-search based 
algorithm working on the space of the covering routes and returning a set of cov¬ 
ering routes R. It works as follows. 

Initial step. We exploit two global set variables, CLmin and CLmax initially 
set to empty (Steps 1-2 of Algorithm [^. These variables contain closed covering 
routes, namely covering routes which cannot be further expanded without violat¬ 
ing the penetration time of at least one target during the visit. CLmax contains the 
covering routes returned by the algorithm (Step 8 of Algorithm [^, while CLmin 
is used for pruning as discussed below. The update of CLmin and CLmax is driven 
by Algorithmic as discussed below. Given a starting vertex v and a signal s, for 
each target t G T(s) such that w* ^ < d{t) we generate a covering route r with 
r(0) = V and r(l) = t (Steps 1-3 of Algorithm [C). Thus, V has at least one 
covering route per target that can be covered in time from v. Notice that if, for 
some t, such minimal route does not exist, then target t cannot be covered because 
we assume triangle inequality. This does not guarantee that A will attack t with 
full probability since, depending on the values vr, A could find more profitable to 
randomize over a different set of targets. The meaning of parameter p is described 
below. 
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Algorithm 3 Branch-and-Bound(v, s, p) 

1 - CLmax ^— 0 

2 . CLmiti ^— 0 

3: for all t S T(s) do 

4; if 10 * j < d{t) then 

5: Tree-Search([p ■ |T(s)|], (i), t)) 

6; end if 

7: end for 
8. return 


Route expansions. The subsequent steps essentially evolve on each branch 
according to a depth-first search with backtracking limited by p (Step 4 of Al¬ 
gorithm]^. The choice of p directly influences the behavior of the algorithm and 
consequently its complexity. Each node in the search tree represents a route r built 
so far starting from an initial route (w, t). At each iteration, route r is expanded by 
inserting a new target at a particular position. We denote with r+(g,p) the route 
obtained by inserting target q after the p-th target in r. Notice that every expansion 
of r will preserve the relative order with which targets already present in r will 
be visited. The collection of all the feasible expansions r+s (i.e., the ones that are 
covering routes) is denoted by and it is ordered according to a heuristic that 
we describe below. Algorithm]^ described below, is used to generate (Step 1 
of Algorithm]^. In each open branch (i.e., R^ ^ 0), if the depth of the node in the 
tree is smaller or equal to [p ■ |T(s) |] then backtracking is disabled (Steps 7-11 of 
Algorithm]^, while, if the depth is larger than such value, is enabled (Steps 5-6 
of Algorithm]^. This is equivalent to fix the relative order of the first (at most) 
[p ■ |T(s) |] inserted targets in the current route. In this case, with p = 0 we do not 
rely on the heuristics at all, full backtracking is enabled, the tree is fully expanded 
and the returned R is complete, i.e., it contains all the non-dominated covering 
routes. Route r is repeatedly expanded in a greedy fashion until no insertion is 
possible. As a result. Algorithm [^generates at most |T(s)| covering routes. 

Pruning. Algorithm]^ is in charge of updating CL min and CLmax each time 
a route r cannot be expanded and, consequently, the associated branch must be 
closed. We call CLmin the minimal set of closed routes. This means that a closed 
route r belongs to CLmin only if CLmin does not already contain another r' C r. 
Steps 1-6 of Algorithm [^implement such condition: first, in Steps 2-3 any route 
r' such that r' 3 r is removed from CLmin, then route r is inserted in CLmin- 
Routes in CLmin are used by Algorithm in Steps 2 and 6 for pruning during 
the search. More precisely, a route r is not expanded with a target q at position 
p if there exists a route r' G CLmin such that r' C R(q,p). This pruning rule 
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Algorithm 4 Tree-Search(fc, r) 

1: _R+= ^ Expand(r) 

2; if i?+ = 0 then 
3; Close(r) 

4; else 

5: if fc > 0 then 

6; Tree-Search(fc — 

7; else 

8; for ail r+ S iJ"*" do 

9: Tree-Search (0, r+) 

10: Close(r+) 

11: end for 

12: end if 

13: end if 


is safe since by definition if r' G CLmin, then all the possible expansions of r' 
are unfeasible and if r' C r then r can be obtained by expanding from r'. This 
pruning mechanism explains why once a route r is closed is always inserted in 
CL rain without chccking the insertion against the presence in CL min of a route r" 
such that r" C r. Indeed, if such route r" would be included in CLmin we would 
not be in the position of closing r, having r being pruned before by Algorithm]^ 
in Step 2 or Step 8. 

We use CLmax to maintain a set of the generated maximal closed routes. This 
means that a closed route r is inserted here only if CLmax does not already contain 
another r' such that r' L r. This set keeps track of closed routes with maximum 
number of targets. Algorithm maintains this set by inserting a closed route r 
in Step 12 only if no route r' 3 r is already present in CLmax- Once the whole 
algorithm terminates, CLmax contains the final solution. 


Algorithm 5 Close(r) 

1: for all r' S CLmin do 
2: if r C r' then 

3. CLmin — CLmin \ {r } 

4: end if 

5: end for 

6. CLmin — CLmin U {r} 

7: for all r' e CLmax do 
8: if r C r' then 

9: return 

10: end if 

11: end for 
12: CL 

max — CLmax U {r} 


Heuristic function, A key component of this algorithm is the heuristic func- 
tion that drives the search. The heuristic function is defined as hr : {T(s) \T(r)} x 


31 










{1... |T(r)|} —)■ Z, where hr(t',p) evaluates the eost of expanding r by inserting 
target t' after the p-th target of r. The basie idea, inspired by ll37l . is to adopt a 
eonservative approaeh, trying to preserve feasibility. Given a route r, let us define 
the possible forward shift of r as the minimum temporal margin in r between the 
arrival at a target t and d{t): 

PFS{r) = - Ar{t)) 

The extra mileage er{t',p) for inserting target t' after position p is the addi¬ 
tional traveling eost to be paid: 

er{t',p) = {Ar{r{f)) + -f ^**,^(^+ 1 )) - Ar{r{p + 1)) 

The advance time that sueh insertion gets with respeet to d{t') is defined as: 

arit'.p) = d{t') - {Ar{r{p)) + 

Finally, hr{t\p) is defined as: 

hr{f,p) = min{a^(f',p); {PFS{r) - er{f,p))} 

We partition the set T(s) in two sets Tught and Tiarge where t G Tught if d{f) < 
and t G Tiarge Otherwise (5 G M is a parameter). The previous inequality 
is a non-binding ehoiee we made to diseriminate targets with a tight penetration 
time from those with a large one. Initially, we insert all the tight targets and only 
subsequently we insert the non-tight targets. We use the two sets aeeording to the 
following rules (see Algorithm]^: 

• the insertion of a target belonging to Tught is always preferred to the inser¬ 
tion of a target belonging to Tiarge, independently of the insertion position; 

• insertions of f G Tught are ranked aeeording to h eonsidering first the inser¬ 
tion position and then the target; 

• insertions of f G Tiarge are ranked aeeording to h eonsidering first the target 
and then the insertion position. 

The rationale behind this rule is that targets with a tight penetration time should 
be inserted first and at their best positions. On the other hand, targets with a large 
penetration time ean be eovered later. Therefore, in this last ease, it is less impor¬ 
tant whieh target to eover than when to eover it. 

Theorem 8 Algorithm^with p = 0 an exact algorithm and has an exponential 
computational complexity since it builds a full tree of covering routes with worst- 
case size 0(|T(s)p(^)l). 
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Algorithm 6 Expand(r) 

1; 

if Aight ^ T{r) then 





2; 

for all q S Tught \ 

') do 









hr(q,Pq^) > hriq.Pq'^^'^) 


Pq = 

. .Pg } S.t. Vi e {1, 


r+ {q, ) is a covering route 






^ Pv' e CL,nin ■ r' C r+{q,p''q'') 

3: 

end for 





4: 


s.t. Vi S 






' p(l) 


5; 

II 

+ 

0^ 

where < 


= ■ ■ 





r(fe) 


6; 

end if 





7: 

if Tiarge ^ T{r) then 





8; 

for all u e Tiarge \ T{ 

r) do 









hr{qp\p) > hr(qP''\p) 


Qp = { 

.. Qp } s.t. Vi e {1, 


r"*" , p) is a covering route 






, Pr' &CLrnin -r' Qr+{q^P\p) 

9; 

end for 





10; 

P = , . . . , p 

G)} s.t. Vi S 







1 ^ 

(fc+i) _ r'b (qP\pW) 

11: 

_R+ = _R+ U 

^(fc+ 2 )^ _ where < • 

= ... 





1 r 

(K) _ r'^{q^\p^^'^) 

12; 

end if 





13; 

return H+ 






3.3.2. Approximation algorithm 

Since p determines the completness degree of the generated tree, we ean ex¬ 
ploit Algorithmj^tuning p to obtain an approximation algorithm that is faster w.r.t. 
the exaet one. 

In faet, when p < 1 eompleteness is not guaranteed in favour of a less com¬ 
putational effort. In this case, the only guarantees that ean be provided for eaeh 
eovering route r G CLmax, onee the algorithm terminates are: 

• no other r' G CLmax dominates r; 

• no other r' ^ CLmax sueh that r r' dominates r. Notiee this does not 
prevent the existenee of a route r" not returned by the algorithm that visits 
targets T(r) in a different order and that dominates r. 

When p is ehosen as (where fc G N is a parameter), the eomplexity of gen¬ 
erating eovering routes beeomes polynomial in the size of the input. We ean state 
the following theorem, whose proof is analogous to that one of Theorem]^ 
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Theorem 9 Algorithm with p = provides an approximation with ra¬ 
tio and runs in 0(|T(s)|^) given that heuristic hr can be computed in 

oms)f). 

3.4. Solving SRG-v 

Now we can formulate the problem of computing the optimal signal-response 
strategy for V. Let us denote with cr^^s(r) the probability with which V plays route 
r under signal s and with the set of all the routes available to V generated by 
some algorithm. We introduce function Uj^(r, t), representing the utility function 
of A and defined as follows: 


UA{r,t) 


7i(t) ift^r 
0 otherwise 


The best V strategy (i.e., the maxmin strategy) can be found by solving the 
following linear mathematical programming problem: 


min Qy s.t. 

E p{s\t) E (^vA^WAirR) < Qv yteT 

s£S{t) r&Rr,s 

E = 1 Vs e ^ 

reRu.s 

>0 Vr e Ry^s, s e S 

The size of the mathematical program is composed of |T| + [S'! constraints 
(excluded > 0 constraints) and 0(|1/|[S'! max^^s{|-Rt;,s|}) variables. This shows 
that the hardness is due only to max„ which, in its turn, depends only 

on |T(s) I. We provide the following remark. 

Remark 3 We observe that the discretization of the environment as a graph is as 
accurate as the number of vertices is large, corresponding to reduce the size of 
the areas associated with each vertex, as well as to reduce the temporal interval 
associated with each turn of the game. Our algorithms show that increasing the 
accuracy of the model in terms of number of vertices requires polynomial time. 


4. SRG-u on special topologies 


In this section, we focus on special topologies, showing in Section 4.1 the 
topologies with which solving a SRG-u is computationally easy, those that are 
hard in Section |4.2[ and the topologies for which the problem remains open in 
Section |431 
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4.1. Easy topologies 

In this section, we show that, with some special topologies, there exists an 
efficient algorithm to solve exactly the SRG-w. 

Let us consider a linear graph. An example is depicted in Figure We state 
the following theorem. 



Figure 4; Linear graph. 


Theorem 10 There is a polynomial-time algorithm solving OPT-SRG-v with lin¬ 
ear graphs. 

Proof. We show that Algorithm requires polynomial time in generating all the 
pure strategies of V. The complexity of Algorithm once applied to a given 
instance, depends on the number of proper covering sets (recall Definition [T^. 
It can be shown that linear graphs have a polynomial number of proper covering 
sets. Given a starting vertex v, any proper covering set Q can be characterized by 
two extreme targets of Q, the first being the farthest from v on the left of v (if 
any, and v otherwise) and the second being the farthest from v on the right of v 
(if any, and v otherwise). For example, see Figure]^ given proper covering set 
Q = {fi, ^ 2 , fs}, the left extreme is ti and the right extreme is t^,. Therefore, the 
number of proper covering sets for each pair w, s is 0(|T(s)p). Since the actions 
available to V are polynomially upper bounded, the time needed to compute the 
maxmin strategy is polynomial. □ 

Let us consider a cycle graph. An example is depicted in Figure We can 
state the following theorem. 

Theorem 11 There is a polynomial-time algorithm solving OPT-SRG-v with cy¬ 
cle graphs (perimeters). 

Proof. The proof is analogous to that one of linear graphs. That is, each proper 
covering set can be characterized by two extremes: the left one and the right one. 
For example, see Figurej^ given proper covering set Q = {fi, ^ 2 , Ia, ^ 5 ,^ 6 , li}, the 
left extreme is and the right extreme is t 2 . As in linear graphs, the number of 
proper covering sets in a cycle graph isO(|T(s)p). □ 

The above results can be generalized to the case of tree graphs where the 
number of leaves is fixed. We can state the following theorem. 
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Theorem 12 There is a polynomial-time algorithm solving OPT-SRG-v with 
tree graphs where the number of leaves is fixed. 

Proof. The proof is analogous to those of linear and cycle graphs. Here, each 
proper covering set can be characterized by a tuple of extremes, one for each path 
connecting n to a leaf. The number of proper covering sets is 0(|T(s) |") where n 
is the number of leaves of the tree. □ 

The above results show that Questions 2-4 are solvable in polynomial time 
with the above special topologies. We show in the next section that when the 
number of leaves in a tree is not fixed, the problem becomes hard. Finally, we 
provide a remark to the above theorem. 


Remark 4 We already showed that, given an arbitrary topology, scaling the graph 


by introducing new vertices is possible with a polynomial-time cost. Theorem 12 


shows that with tree-based graphs this holds even when we introduce new targets. 


4.2. Hard topologies 

Let us consider a special topology, as shown in Figure and defined in the 
following. 

Definition 11 (S2L-STAR) Special 2-level star graph instances (S2L-STAR) are: 
• V = {uo, Vi,V 2 ,. ■ ■, V 2 n\, where Vq is the starting position; 
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• T = V\ {vo}. where vertices Vi with i G {1,..., n} are called inner targets, 
while vertices Vi with i G {n + 1,..., 2n} are called outer targets; 

• E = {(no,ni), {vi,Vn+i) : Vi G {1,...,«}} and we call i-th branch the 
pair of edges {{vo,Vi), {vi,Vn+i)); 

• travel costs are c{vo, Vi) = c{vi, Vn+i) = 'll for every i G {1,..., n}, where 
7i e N+; 


• penetration times are, for i G {1,..., n}, d{f) 
where H = 

• Tiif) = 1 for every t E T. 


6H — 3% t = Vi 
10-/V 2^^, t 


Initially, we show a property of S2L-STAR instances that we shall use below. 


Lemma 13 If an instance of S2L-STAR admits a maximal covering route r that 
covers all the targets, then the branches can be partitioned in two sets Ci and C 2 
such that: 


• all the branches in Ci are visited only once while all the branches in C 2 are 
visited twice, and 


• Y. ii= Y. ii = H. 

ieCi ieC2 

Proof Initially, we observe that, in a feasible solution, the visit of a branch can 
be of two forms. If branch i is visited once, then V will visit the inner target before 
time 6H — 87 * and immediately after the outer target. Ci denotes the set of all the 
branches visited according to this form. If branch i is visited twice, then V will 
visit at first the inner target before time 6H — 87 *, coming back immediately after 
to Vo, and subsequently in some time after 6H — 87 *, but before 10 H — 2yi, V will 
visit again the inner target and immediately after the outer target. C 2 denotes the 
set of all the branches that are visited according to this form. All the other forms of 
visits (e.g., three or more visits, different order visits, and visits at different times) 
are useless and any route in which some branch is not neither in Ci nor in C 2 can 
be modified such that all the branches are either in Ci or in C 2 strictly decreasing 
the cost of the solution as follows: 


• if branch i is visited only once and the visit of the inner target is after time 
OH — 87 *, then the solution is not feasible; 
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Figure 6: Special 2-level star graph. 


• if branch i is visited twice and the first visit of the inner target is after time 
6H — S'ji, then the solution is not feasible; 

• if branch i is visited twice and the second visit of the inner target is before 
time 6H — 87 *, then the first visit of the branch can be omitted saving 277 

• if branch i is visited twice and the outer target is visited during the first visit, 
then the second visit of the branch can be omitted saving > 27 *; 

• if branch i is visited three or more times, all the visits except the first one in 
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which the inner target is visited and the first one in whieh the outer target is 
visited ean be omitted saving > 27 *. 

We assume that, if there is a maximal eovering route r that eovers all the 
targets, then the visits are sueh that Ci U C 2 = {1,..., n} and therefore that eaeh 
braneh is visited either onee or twiee as diseussed above. We show below that in 
S2L-STAR instanees sueh an assumption is always true. Sinee r eovers all the 
targets, we have that the following eonditions are satisfied: 

+ ( 1 ) 

i&C2 ieCi 

6 X] + 4 ^ 7i < lOif (2) 

ieC2 ieCi 

Constraint Q requires that the eost of visiting entirely all the branehes in Ci 
and partially (only the inner target) all the branehes in C 2 is not larger than the 
penetration times of the inner targets. Notiee that this holds only when the last 
inner target is first-visited on a braneh in Ci. We show below that sueh assumption 
is always verified. Constraint Q requires that the eost of visiting entirely all the 
branehes in Ci and at first partially and subsequently entirely all the branehes in 
C 2 is not larger than the penetration times of the outer targets. We ean simplify 
the above pair of eonstraints as follows: 


7i +2 ^ 7j < <6H 

i&C2 ieCi ieCi 

'-V-' 

4H 

2X]7i + 4^7i + 4^7*< lOif 

i&C2 i&C2 ieCi 

'-V-' 

8H 

obtaining: 


ieCi 


ieC2 


sinee, by definition, 7 * + SieC 2 7 * ^ foUows that: 

i&Ci ieC2 
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Therefore, if r eovers all the targets and it is such that all the branches belong 
either to Ci or to C 2 , we have that r visits the last outer target exactly at its pen¬ 
etration time. This is because Constraints Q and ([^ hold as equalities. Thus, as 
shown above, in any route in which a branch is not neither in Ci nor in C 2 we can 
change the visits such that all the branches are in either Ci or C 2 , strictly reducing 
the total cost. It follows that no route with at least one branch that is not neither 
in Cl nor in C 2 can have a total cost equal to or smaller than the penetration time 
of the outer targets. Similarly, from the above equality it follows that any solution 
where the last inner target is first-visited on a C 2 branch can be strictly improved 
by moving such branch to Ci and therefore no route in which the last inner target 
is first-visited on a C 2 branch can have a total cost equal to or smaller than the 
penetration time of the outer targets. □ 

Definition 12 (PARTITION) The decision problem PARTITION is defined as: 
INSTANCE: A finite set I = {1,2,... ,1}, a size a* G N’*' for each i E I, and a 
bound B G N’*' such that = 25. 

QUESTION: Is there any subset I' C I such that = J2iei\r = B? 

We can now state the following theorem: 

Theorem 14 k-SRG-v is NV-hard even when restricted to S2L-STAR instances. 

Proof. We provide a reduction from PARTITION that is known to be weakly J\fV- 
hard. For the sake of clarity, we divide the proof in steps. 

Reduction. We map an instance of PARTITION to an instance of /c-SRG-n on 
S2L-STAR graphs as follows 

• S = {s}, 

• n = I (i.e., the number of branches in S2L-STAR equals the number of 
elements in PARTITION); 

• 'll = CLi for every i E I; 

• H = B, 

• k = 0. 

The rationale is that there is a feasible solution for PARTITION if and only if 
there is the maximal covering route that covers all the targets in a k-SRG-v on a 
S2L-STAR graph. 
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If. From Lemma we know that, if there is the maximal covering route that 
covers all the targets in a k-SRG-v on a S2L-STAR graph, then the branches 
can be partitioned in two sets Ci, C 2 such that = if. By 

construction 7* = a* and H = B. So, if there is the maximal covering route that 
covers all the targets in a k-SRG-v on a S2L-STAR graph, then there is partition 
of set I in two subsets I' = Ci and I” = C 2 such that ^ ^ 

H = B = Yli£C2 ~ 

Only if. If PARTITION admits a feasible solution, then, once assigned /' = Ci 
and I" = C 2 , it is straightforward to see that the route visits all the targets by their 
penetration times and therefore that the route is a maximal covering route. □ 

Let us notice that the above reduction, differently from that of Theorem[^ does 
not exclude the existence of an FPTAS, i.e.. Fully Polynomial Time Approxima¬ 
tion Scheme. This may hold since PARTITION admits an FPTAS. Furthermore, 
we observe that S2L-STAR graphs are special kinds of trees and therefore k- 
SRG-u on trees is A/'P-hard. Finally, we observe that the above result shows that 
it is unlikely that there is a polynomial-time algorithm solving Questions 1-4. 

4.3. Borderline topologies 

Let us consider a star graph, as shown in Figure |7} defined as follows. 

Definition 13 (SIMPLE-STAR) Simple star graph instances (SIMPLE-STAR) 
are: 


• V = {uo, vi,V 2 ,. ■ ■ ,Vn}, where uq is the starting vertex ofV; 
. T = V\ {no}; 

• E = {(uo,Ui),Vi e ( 1 ,... ,n}}; 

• travel costs are c(vo, Vi) = 7 *, where 7 ^ G N’*'; 

• penetration times di and values 7 r(ui) can be any. 
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Figure 7: Star graph. 


We can state the following theorem. 

Theorem 15 If the maximal covering route r covering all the targets exists, the 
Earliest Due Date algorithm returns r in polynomial time once applied to SIMPLE- 
STAR graph instances. 

Proof. The Earliest Due Date ll^ (EDD) algorithm is an optimal algorithm for 
synchronous (i.e., without release times) aperiodic scheduling with deadlines. 
It executes (without preemption) the tasks in ascending order according to the 
deadlines, thus requiring polynomial complexity in the number of tasks. Any 
SIMPEE-STAR graph instance can be easily mapped to a synchronous aperiodic 
scheduling problem: each target ti is an aperiodic task Jj, the computation time 
of Ji is equal to 27 ^, the deadline of task Jj is d{ti) + 7 *. It is straightforward to 
see that, if EDD returns a feasible schedule, then there is the maximal covering 
route, and, if EDD returns a non-feasible schedule, then there is not any maximal 
covering route. □ 

The above result shows that Question 2 can be answered in polynomial time. 
We show that also Question 3 can be answered in polynomial time be means of a 
simple variation of EDD algorithm. 

Theorem 16 Given a signal s, the best pure strategy ofV in an SRG-v game on 
SIMPLE-STAR graph instances can be found in polynomial time. 

Proof. Given a signal s, the algorithm that finds the best pure strategy is a variation 
of the EDD algorithm. Eor the sake of clarity, we describe the algorithm in the 
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simplified case in which there is only one signal s. The extension to the general 
case is straightforward. The algorithm works as follows: 

1. apply EDD, 

2 . if the maximal covering route exists, then return it, 

3. else remove the target t with the smallest 7r(f) from T(s), 

4. go to Point 1. 

Essentially, the algorithm returns the subset of targets admitting a covering route 
minimizing the maximum value among all the non-covered targets. □ 

Although the treatment of SIMPEE-STAR graph instances in pure strategies 
is computationally easy, it is not clear if the treatment keeps being easy when V 
is not restricted to play pure strategies. We just observe that Algorithm 1 requires 
exponential complexity, the number of proper covering sets being exponential. 
Thus, the complexity of solving Questions 1 and 4 remains unaddressed. 


5. Patrolling game 


In this section, we focus on the PG. Specifically, in Section 5.1 we state our 
main result showing that patrolling is not necessary when an alarm system is 
present, in Section we propose the algorithm to deal with the PG, in Sec¬ 
tion 15^ we summarize the complexity results about Questions 1-4. 


5.1. Standstill 

We focus on the problem of finding the best patrolling strategy given that we 
know the best signal-response strategy for each vertex v in which V can place. 
Given the current vertex of V and the sequence of the last, say n, vertices visited 
by V (where n is a tradeoff between effectiveness of the solution and computa¬ 
tional effort), a patrolling strategy is usually defined as a randomization over the 
next adjacent vertices flUl. We define v* = argmin^g^j^f^}, where is the value 
returned by the optimization problem described in Section 3.3, as the vertex that 
guarantees the maximum expected utility to V over all the SRG-us. We show that 
the maxmin equilibrium strategy in PG prescribes that V places at v*, waits for a 
signal, and responds to it. 


Theorem 17 Without false positives and missed detections, if'it G T we have 
that |S'(f)| > 1, then any patrolling strategy is dominated by the placement in v*. 
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Proof. Any patrolling strategy different from the placement in v* should neces¬ 
sarily visit a vertex v' 7 ^ v*. Since the alarm system is not affected by missed 
detections, every attack will raise a signal which, in turn, will raise a response 
yielding an utility of where x is the current position of V at the moment 
of the attack. Since A can observe the current position of V before attacking, 
X = arg max.j,gp{ 5 (^} where P is the set of the vertices patrolled by V. Obviously, 
for any P D {n*} we would have that > gv* and therefore placing at v* and 
waiting for a signal is the best strategy for V. □ 

The rationale is that, if the patrolling strategy of V prescribes to patrol a set 
of vertices, say V', then, since A can observe the position of V, the best strategy 
of A is to wait for V being in v' = argmax^gy,{ 5 (^} and then to attack. Thus, by 
definition of g^*, if V leaves v* to patrol additional vertices the expected utility it 
receives is no larger than that it receives from staying in v*. 

A deeper analysis of Theorem [T7] can show that its scope does include cases 
where missed detections are present up to a non-negligible extent. For such cases, 
placement-based strategies keep being optimal even in the case when the alarm 
systems fails in detecting an attack. We encode the occurrence of this robustness 
property in the following proposition, which we shall prove by a series of exam¬ 
ples. 

Proposition 1 There exist Patrolling Games where staying in a vertex, waiting for 
a signal, and responding to it is the optimal patrolling strategy for V even with a 
missed detection rate a = 0.5. 

Proof The expected utility for V given by the placement in v* is (1 — a)(l — g^*), 
where (1 — a) is the probability with which the alarm system correctly generates 
a signal upon an attack and (1 — denotes V's payoff when placed in u*. A 
non-placement-based patrolling strategy will prescribe, by definition, to move 
between at least two vertices. From this simple consideration, we observe that 
an upper bound to the expected utility of any non-placement strategy is entailed 
by the case where V alternately patrols vertices v* and v^, where is the second 
best vertex in which V can statically place. Such scenario gives us an upper bound 
over the expected utility of non-placement strategies, namely 1 — g.^,*. It follows 
that a sufficient condition for the placement in v* being optimal is given by the 
following inequality: 




( 3 ) 
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To prove Proposition it then suffiees to provide a Patrolling Game instanee 
where Equationj^holds under some non-null missed deteetion rate a. In Fig.j^a) 
and Fig.j^b), we report two of sueh examples. The depleted settings have unitary 
edges exeept where explieitly indieated. For both, without missed deteetions, the 
best patrolling strategy is a plaeement v* = 4. When allowing missed deteetions, 
in Fig. j^a) it holds that = 0 and g^* = 0.75, where v* = A and = 1. Thus, 
by Equation!^ placement v* = A is the optimal strategy for a < 0.25. Under the 
same reasoning scheme, in Fig.j^b) we have that gy* = 0 and gy* = 0.5, making 
the placement v* = 4 optimal for any a < 0.5. □ 
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(a) Equation]^ holds for a < 0.25. 
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(b) Equation]^ holds for a < 0.5. 


Eigure 8; Two examples proving Proposition [T] 


It is reasonable to expect that a similar result holds also for the case with 
false positives. However, dealing with false positives is much more intricate than 
handling false negative and requires new models, e.g., V could respond to an 
alarm signal only with a given probability and with the remaining probability 
could stay in the current vertex. For this reason, we leave the treatment of false 
positives and a more accurate treatment of false negatives to future works. 
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5.2. Computing the best placement 


Under the absence of false positives and missed detections, Theorem 17 


srm- 

plifies the computation of the patrolling strategy by reducing it to the problem of 
finding v*. To such aim, we must solve a SRG-u for each possible starting vertex 
V and select the one with the maximum expected utility for V. Algorithm|^depicts 
the solving algorithm. Function SolveSRG{v) returns the optimal value 1 — gv*- 
The complexity is linear in \ V\, once has been calculated for every v. 


Algorithm 7 BestPlacement(G, s) 

1: U(v) <— 0 for every v & V 
2: for all u g V do 
3: U{v) ^ SolveSRG{v) 

4; end for 
5: return max ([7) 


Since all the vertices are possible starting points, we should face this hard 
problem (see Theorem \V\ times, computing, for each signal, the covering 
routes from all the vertices. To avoid this issue, we ask whether there exists an 
algorithm that in the worst case allows us to consider a number of iterations such 
that solving the problem for a given starting vertex v could help us finding the 
solution for another starting vertex v'. In other words, considering a specific set 
of targets, we wonder whether a solution for COV-SET with starting vertex v 
can be used to derive, in polynomial time, a solution to COV-SET for another 
starting vertex v'. This would allow us to solve an exponential-time problem only 
once instead of solving it for each vertex of the graph. To answer this question, 
we resort to hardness results for reoptimization, also called locally modified prob¬ 
lems [|S9l- We show that, even if we know all the covering routes from a starting 
vertex, once we changed the starting vertex selecting an adjacent one, finding the 
covering routes from the new starting vertex is hard. 

Definition 14 (LM-COV-ROUTE) A locally modified covering route (LM-COV- 
ROUTE) problem is defined as follows: 

INSTANCE: graph G = (U, E), a set of targets T with penetration times d, two 
starting vertices Vi and V 2 that are adjacent, and a covering route ri with ri(0) = 
Vi such thatT{ri) = T. 

QUESTION: is there r 2 with r2(0) = V 2 and Tfg) = T? 

Theorem 18 lm-COV-ROUTE is fifV-complete. 
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Proof. We divide the proof in two steps, membership and hardness. 

Membership. Given a YES eertifieate eonstitutes by a route, the verifieation is 
easy, requiring one to apply the route and eheek whether eaeh target is visited by 
its deadline. It requires linear time in the number of targets. 

Hardness. Let us eonsider the Restrieted Hamiltonian Cireuit problem (RHC) 
whieh is known to be A/^P-eomplete. RHC is defined as follows: given a graph 
Gh = {Vh, Eh) and an Hamiltonian path P = (/ii, ..., hn) for Gh sueh that hi G 
Vh and (/ii, hn) ^ Eh, find an Hamiltonian eyele for Gh- From sueh instanee of 
RHC, following the approaeh of [l39l, we build the following instance for LM- 
COV-ROUTE: 

• V = VhU {vi,V2,vt}; 

• T = VhU {ut}; 

• E = EhU {{hn, vt), {hi, Vs) : i e [1,.. ., n}}; 

• d{vt) = n + 1 and d{t) = n for any f e T with t 7 ^ vp 
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Basically, given Gh we introduce three vertices vi,V 2 ,Vt, where vi, V 2 are adjacent 
starting vertices and Vt is a target. We know the covering routes from vi, and we 
aim at finding the covering routes from V 2 . The new starting vertex (^ 2 ) is closer 
to hn-i than the previous one (ui) by 1 and farther from hi than previous one 
(ui) by 1. There is no constraint over the distances between the starting vertices 
and the other targets except that they are larger than or equal to 2. We report in 
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Figure 9; Example of construction used in the proof of Theorem [T^ the Hamiltonian path 
(hi, ^ 2 , ha, hi, ha, hg, hy, hg, hg, hio) on G// is given, as well as covering route ri with ri(0) = 
Vi and T(ri) = T. It can be observed that there is another Hamiltonian path for Gh, i e., 
(hg, hg, hg, hi, hi, hg, hg, hg, hy, hio), allowing covering route rg with r 2 ( 0 ) = ug and r(r 2 ) = 
T. Notice that, if we remove the edge (hg, hg), then covering route rg such that r2(0) = ug and 
’^(rg) = T does not exist. 


Figure]^ an example of the above eonstruetion. Notiee that by eonstruetion, if the 
maximal eovering route r 2 with r2(0) = V 2 and T(r 2 ) = T exists, then Vt must 
be the last visited target. Route ri is eovering sinee {hi,, h^) is a Hamiltonian 
path for Gh- We need to show that route r 2 with r2(0) = V 2 and T{r 2 ) = T exists 
if and only if Gh admits a Hamiltonian cycle. It can observed that, if r 2 exists, 
then it must be such that rg = (ug, hn-i,... ,hn,Vt) and therefore {hn-i,... ,hn) 
must be a Hamiltonian path for Gh- Since we know, by ri, that {hn-i-, hn) G Eh, 
it follows that {hn-i, - - -, hn, hn-i) is a Hamiltonian cycle. This concludes the 
proof. □ 

This shows that iteratively applying Algorithm to SRG-r for each starting 
vertex v and then choosing the vertex with the highest utility is the best we can do 
in the worst case. 

5.3. Summary of results 

We summarize our computational results about Questions 1-4 in Table in¬ 
cluding also results about the resolution of the PG. We use ‘?’ for the problems 
remained open in this paper. 

6. Experimental evaluation 

In this section, we experimentally evaluate our algorithms. We implemented 
our algorithms in MATLAB and we used a 2.33GHz LINUX machine to run our 
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Topology 

Question 

Linear 

Cycle 

Star 

Tree 

Arbitrary 

Question 1 

PP 

PP 

? 

PAfP-hard 

^PP-hard 

Question 2 

P 

P 

P 

ATP-hard 

A/”P-hard 

Question 3 

P 

P 

P 

A/'P-hard 

A/”P-hard 

Question 4 

P 

P 

? 

A/'P-hard 

A/”P-hard 

Question 2 
Reoptimization 

PP 

PP 

7 

7 

A/'P-hard 


Table 1: Computational complexity of discussed questions. 


experiments. For a better analysis, we provide two different experimental evalua¬ 


tions. In Seetion 6.1, we apply our algorithms to worst-ease instances suggested 
by our TV^'P-hardness reduction, in order to evaluate the worst-case performance 
of the algorithms and to investigate experimentally the gap between our AVX- 
hardness result and the theoretical guarantees of our approximation algorithms. In 
Section [6!2| we apply our algorithms to a specific realistic instance we mentioned 
in SectionExpo 2015. 

6.1. Worst-case instances analysis 

We evaluate the scalability of Algorithm and the quality of the solution re¬ 
turned by our approximation algorithms for a set of instances of SRG-u. We do 
not include results on the evaluation of the algorithm to solve completely a PG, 
given that it trivially requires asymptotically \V\ times the effort required by the 
resolution of a single instance of SRG-u. In the next section we describe our ex¬ 


perimental setting, in Section 6.1.2 we provide a quantitative analysis of the exact 


algorithms while in Section 6.1.3 we evaluate the quality of our approximations. 


6.1.1. Setting 

As suggested by the proof of Theorem]^ we can build hard instances for our 
problem from instances of HAMILTONIAN-PATH. More precisely, our worst- 
case instances are characterized by: 


all the vertices are targets, 
edge costs are set to 1, 
there is only one signal. 
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• penetration times are set to |T| — 1, 

• values are drawn from (0,1] with uniform probability for all the targets, 

• the number of edges is drawn from a normal distribution with mean e, said 

edge density and defined as e = and 

• starting vertex v is drawn among the targets of T with uniform probability. 

We explore two parameter dimensions: the number of targets |T| and the value of 
edge density e. In partieular, we use the following values: 


|T| G {6,8,10,12,14,16,20,25,30,35,40,45,50}, 
e e {0.05, 0.10,0.25, 0.50, 0.75,1.00}. 


For eaeh eombination of values of |T| and e, we randomly generate 100 instanees 
with the eonstraint that, if < |T|, we introduee additional edges in order to 
assure the graph eonneetivity. The suitability of our worst-ease analysis is eor- 
roborated by the results obtained with a realistie setting (see Seetion 6.2) whieh 
present hard subproblems eharaeterized by the features listed above. 


6.1.2. Exact algorithms scalability 

We report in Figure[T^the eompute time (averaged over 100 SRG-u instanees) 
required by our exaet dynamie programming algorithm (Algorithm[^, with the an¬ 
notation of dominated (proper) eovering sets and the generation of the routes, as 
|T| and e vary. We report in Appendix B the boxplots showing the statistieal sig- 
nifieanee of the results. It ean be observed that the eompute times are exponential 
in |T|, the eurves being lines in a semilogarithmie plot, and the value of e deter¬ 
mines the slope of the line. Notiee that with e G {0.05,0.10,0.25} the number 
of edges is almost the same when |T| < 16 due to the eonstraint of eonneetiv- 
ity of the graph, leading thus to the same eompute times. Beyond 16 targets, the 
eompute times of our exaet dynamie programming algorithm are exeessively long 
(with only e = 0.25, the eompute time when |T| = 20 is lower thanlO^ seeonds). 
Interestingly, the eompute time monotonieally deereases as e deereases. This is 
thanks to the faet that the number of proper eovering sets dramatieally reduees as 
e reduees and that Algorithm [^enumerates only the proper eovering sets. 

We do not report any plot of the eompute times of our exaet braneh-and-bound 
algorithm, sinee it requires more than 10^ seeonds when |T| > 8 even with e = 
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Figure 10: Compute times in seconds of our exact dynamic programming algorithm (Algorithm[2l, 
with the annotation of dominated (proper) covering sets and the generation of the routes, as |T| 
and e vary. 

0.25, resulting thus non-applicable in practice. This is because the branch-and- 
bound algorithm has a complexity 0(|T|l^l), while the dynamic programming 
algorithm has a complexity 0(2l^l). 

Figure shows the impact of discarding dominated actions from the game 
when e = 0.25. It depicts the trend of some performance ratios for different met¬ 
rics. We shall call Q the complete game including all "D’s dominated actions and 
Qr the reduced game; CCS will denote the full version of Algorithm and LP 
will denote the linear program to solve SRG-n. Each instance is solved for a ran¬ 
dom starting vertex v\ we report average ratios for 100 instances, “n. covsets” is 
the ratio between the number of covering sets in and in Q. Dominated actions 
constitute a large percentage, increasing with the number of targets. This result 
indicates that the structure of the problem exhibits a non-negligible degree of re¬ 
dundancy. LP times (iterations) report the ratio between and Q for the time 
(iterations) required to solve the maxmin linear program. A relative gain directly 
proportional to the percentage of dominated covering sets is observable (LP has 
less variables and constraints). A similar trend is not visible when considering the 
same ratio for the total time, which includes CCS. Indeed, the time needed by CCS 
largely exceed LP’s and removal of dominated actions determines a polynomial 
additional cost, which can be seen in the slightly increasing trend of the curve. 
The relative gap between LP and CCS compute times can be assessed by look- 
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ing at the LP/CCS curve: when more targets are considered the time taken by LP 
is negligible w.r.t. CCS’s. This shows that removing dominated actions is useful, 
allowing a small improvement in the average case, and assuring an exponential 
improvement in the worst case. 



—e— LP (time) 

X LP (n. iterations) 
-+- LP + CCS (time) 

- * - n. covsets 
□ LP/CCS (time G^) 

<> LP/CCS (time G) 


Figure 11; Ratios evaluating dominances with e = 0.25 as |T| varies. 

Figure [T^ shows the game value for V, 1 — g^, as |T| and e vary (averaged 
over 100 instances). It can be observed that the game value is almost constant as 
|T| varies for e G {0.05, 0.10, 0.25} and it is about 0.87. This is because all these 
instances have a similar number of edges, very close to the minimum number 
necessary for having connected graphs. With a larger number of edges, the game 
value increases. Interestingly, fixed a value of e, there is a threshold of |T| such 
that beyond the threshold the game value increases as |T| increases. This suggests 
that the minimum game value is obtained for connected graphs with the minimum 
number of edges. 

In Tab. we report compute times with multiple signals, where the targets 
covered by a signal and the probability that a target triggers a signal are randomly 
chosen according to a uniform distribution. Values are averages over 100 random 
instances and give insights on the computation effort along the considered dimen¬ 
sions. The results show that the problem is computationally challenging even for 
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Figure 12; Optimal game values as |r| and e vary, 
a small number of targets and signals. 


m 

5 

10 

15 

2 

- 

17.83 

510.61 

3 

- 

33.00 

769.30 

4 

0.55 

35.35 

1066.76 

5 

0.72 

52.43 

1373.32 


Table 2: Compute times (in seconds) for multi-signal instances. 


6.1.3. Approximation algorithms 

We evaluate the actual approximation ratios obtained with our approximation 
algorithms as (1 — /(I — where is the expected utility of A at the equi¬ 

librium considering all the covering sets and % is the expected utility of A at the 
equilibrium when covering sets are generated by our heuristic algorithm. We exe¬ 
cute our approximation dynamic programming algorithm with a different number, 
say RandRes, of randomly generated orders from {10,20,30,40,50}, in addi¬ 
tion to the 3 heuristics discussed in Section 3.2.2 We executed our approximation 
branch and bound algorithm with constant values of p from (0.25, 0.50, 0.75,1.00} 
(we recall that with p = 1.00 backtracking is completely disabled). 
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Figure and Figure [^report the aetual approximation ratios (averaged over 
100 instanees) obtained with our approximation algorithms for different values 
of |T| G {6, 8,10,12,14,16}, i.e., the instanees for whieh we know the optimal 
game value, and e G {0.05, 0.10, 0.25, 0.50,0.75,1.00}. We remark that the ra¬ 
tios obtained with the approximation braneh-and-bound algorithm for some val¬ 
ues of p are omitted. This is beeause the eompute time needed by the algorithm 
is over 10^ seeonds. The algorithm always terminates by the deadline for only 
pG (0.75,1.00}. We foeus on the ratios obtained with the dynamie programming 
algorithm. Given a value of e, as |T| inereases, the ratio deereases up to a given 
threshold of |T| and then it is a eonstant. The threshold inereases as e deereases, 
while the eonstant deereases as e deereases. The value of the eonstant is high for 
every e, being larger than 0.8. Although the ratios inerease as RandRes inereases, 
it is worth noting that the inerease is not very signifieant, being of the order of 
0.05 between 10 RandRes and 50 RandRes. We foeus on the ratios obtained with 
the braneh-and-bound algorithm. Given a value of e, as |T| inereases, the ratio 
deereases up to a given threshold of |T| and then it inereases approaehing a ratio 
of 1. The threshold inereases as e deereases, while the minimum ratio deereases 
as e deereases. Interestingly, ratios with p = 1.00 are very elose to ratios with 
p G 0.75, showing that performing even signifieant baektraeking around the so¬ 
lution found with p = 1.00 does not lead to a signifieant improvement of the 
solution. The solution ean be effeetively improved only with p = 0.25, but it is 
not affordable due to the exeessive required eompute time. This shows that the 
heuristie performs very well. Comparing the ratios of the two algorithms, it ean 
be observed that the approximation dynamie programming algorithm performs 
better than the approximation braneh-and-bound algorithm. While the dynamie 
programming one always provides a ratio larger than 0.8, the braneh-and-bound 
one provides for eombinations of |T| and e ratios lower than 0.4. 

Figure [^reports the game values obtained with the approximation dynamie 
programming algorithm for every value of RandRes and with the approximation 
braneh-and-bound algorithm when |T| G (20, 25, 30,35,40,45, 50} only for p = 
1.00. Indeed, with p = 0.75 the eompute time is exeessive and, as shown above, 
the purely heuristie solution eannot be signifieantly improved for e > 0.75. We 
report experimental results only for e G {0.05,0.25}. We notiee that for these 
instanees we do not have the optimal game value. However, sinee the optimal 
game value eannot be larger than 1 by eonstruetion of the instanees, the game 
value obtained with our approximation algorithms represents a lower bound to the 
aetual approximation ratio. It ean be observed that, given a value of e, the ratios 
obtained with the dynamie programming algorithm are essentially eonstant as |T| 
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e = 0.05 


e = 0.25 




Figure 15: Game values as |T| varies. 


increases and this eonstant redueed as e reduees. Surprisingly, after a eertain value 
of |T|, the game values obtained with the braneh and bound algorithm are higher 
than those obtained with the dynamie programming algorithm. This is beeause, 
fixed a value of e, as |T| inereases, the problem beeomes easier and the heuristie 
used by the braneh and bound algorithm performs well finding the best eovering 
routes. This shows that there is not an algorithm outperforming the other for every 
eombination of parameters |T| and e. Furthermore, the above result shows that 
the worst oases for the approximation algorithms are those in whioh e = 0(|7^), 
oorresponding to instanoes in whioh the number of edges per vertex is a constant 
in Ir|. It is not clear from our experimental analysis whether inoreasing |T| with 
e = ||^ for some u > 1 the game value approaehes to 0 or to a strictly positive 
value. However, our approximation algorithms provide a very good approximation 
even with a large number of targets and a small value of e. 

Figure [^reports the oompute times required by the approximation dynamie 
programming algorithms. As it ean be seen, the required time slightly inereases 
when adopting a larger number of randomly generated orders with respeet to the 
baseline with p = 1.00. 


6.2. Real case study 

In this seetion we present the results obtained by applying our approaeh to a 
real ease study, in order to show an example of real applieation of our model. We 
imagine to faee the task of proteeting a fair site as we already diseussed in See¬ 
tion 1.1.2 and we foeus on the partieular setting of Expo 2015. Figure [TV] shows 
the map of the Expo 2015 site together with its graph representation. We manually 
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Figure 16: Time ratios as |T| varies. 


build a discretized version of the site map by exploiting publiely available knowl¬ 
edge of the evenj^ We identify ~ 170 sensible loeations whieh eorrespond to an 
equal number of targets in our graph. More speeifieally, we identify ~ 130 targets 
loeated at the entranees of eaeh pavilion and in the surroundings of those areas 
whieh eould be of interest for a high number of visitors. Some targets (~ 35) are 
loeated over the main roads, being these eritieal mainly due to their high erowd. 
Such roads also define our set of edges whieh resulted in a density of ~ 0.02. 
Figure [Tt] reports a graphical representation of chosen deadlines d{-) and values 
7r(-), respectively. To determine such values in a reasonable way we apply a num¬ 
ber of simple rules of thumb. First, to ease our task, we diseretize the spaees of 
possible deadlines and values in four different levels. To assign a value to a target, 
we estimate the interest (in terms of popularity and expeeted erowd) of the eorre- 
sponding area in the fair site. The higher the interest, the higher the value (aetual 
values are reported in the figure). To assign deadlines we estimate the time an 
attacker should spend to eseape from the attaeked target after some malieious ae- 
tivity is started (for example, blending into the erowd without leaving any traee). 
In partieular, we estimate a smaller eseape time for those loeations lying near the 
external border of the fair site while for locations that are more eentral we esti¬ 
mated a larger time. The smaller the eseape time, the tighter the deadline for that 
target. Aetual values are extracted from a normal distribution where cr^ = 1 and 
/i is set aeeording to the chosen level. The maximum distanee between any two 
target loeations is about 1.5Km whieh we assume ean be be eovered in about 7.5 


^Detailed information can be found at http : //www. expo2 015 . org/ 
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minutes (we imagined a erowded seenario). Given sueh referenee seenario, our 
means span from 5 minutes (very tight) to 7.5 minutes (very large). To derive our 
alarm system model we assume to have a number of panoramie eameras deployed 
in the environment at loeations we manually ehoose in order to eover the whole 
environment and to guarantee a wide area of view for eaeh eamera (i.e., trying to 
keep, in general, more than one target under eaeh eamera’s view). To map our set 
of eameras over the alarm system model, we adopt this eonvention: eaeh group of 
eameras sharing an independent partial view of a target t is assoeiated to a signal 
s E S{t); if target t is eovered by k signals then eaeh signal is generated with 
probability 1/k onee t is attaeked. Obviously, a deeper knowledge of the seeurity 
systems deployed on the site ean enable speeifie methods to set the parameters 
of our model. This is why we eneourage involving ageneies in eharge of seeurity 
when dealing with sueh task. 



Figure 18; Best placement and attack locations. 

We first show a qualitative evaluation of our method. Figure depiets the 
best plaeement for the Defender (the eirele in the figure) and the attaeked tar¬ 
gets (the squares in the figure, these are the aetions played by the Attaeker with 
non-null probability at the equilibrium). As intuition would suggest, the best lo- 
eation from where any signal response should start is a eentral one w.r.t. the whole 
fair site. Our simulations show that the optimal patrolling strategy eoineides with 
sueh fixed plaeement even under false negatives rates of at least 0.3. Notiee 
that sueh false negatives value ean be eonsidered unrealistieally pessimistie for 
alarm systems deployed in struetured environment like the one we are dealing 
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Covering set 

Probability 

{19, 20, 23, 25, 125, 126, 127, 128} 

0.0194 

(10, 12, 23, 24, 25, 126, 127, 128} 

0.0231 

(10, 12, 24, 25, 126, 127, 128, 129} 

0.0333 

{12, 23, 24, 25, 126, 127, 128, 129} 

0.0494 

{10, 12, 23, 24, 25, 125, 126, 128} 

0.0344 

{10, 12, 24, 25, 125, 126, 128, 129} 

0.0488 

{10, 12, 25, 125, 126, 127, 128, 129} 

0.0493 

{12, 23, 24, 25, 125, 126, 127, 128} 

0.0502 

{12, 24, 25, 125, 126, 127, 128, 129} 

0.0692 

{19, 20, 23,25, 125, 126, 129} 

0.0492 

{19, 20, 23, 125, 126, 128, 129} 

0.0492 

{20, 23, 25, 126, 127, 128, 129} 

0.0657 

{10, 23,25, 125, 126, 127, 128} 

0.0662 

{19, 20, 23,25, 125, 128, 129} 

0.0412 

{23, 25, 125, 126, 127, 128, 129} 

0.1146 

{20, 23, 24, 25, 127, 128} 

0.0645 

{10, 12, 24, 125, 126, 127} 

0.0877 

{20, 23, 24, 25, 128, 129} 

0.0846 


Figure 19: Example of response strategies to signal. 




with. Attacked targets correspond to areas, which exhibit rather high interest and 
small escape time. Figure reports an example of signal response strategy for 
a given starting vertex (the small circle in the figure) and a given signal (whose 
covered targets are depicted with the large circle in the figure). The table lists the 
computed covering sets and the probabilities with which the Defender plays the 
corresponding covering routes. 



(a) Time boxplots by signal 
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(b) Time boxplots by node 


Figure 20: Time boxplots for our real case study. 


Boxplots of Figure |20(a)| provide some quantitative insights on the computa¬ 
tional effort we measured in solving such realistic instance. Given a signal, we 
report the statistical distribution of the time required by Algorithm to compute 
covering routes from each possible start vertex. In general, we observe a high vari¬ 
ance in each boxplot. Indeed, once fixed a signal s in our realistic instance, it is 
easy to identify starting vertices from which computing covering routes is likely 
to be very easy or, instead, much harder. For the easy case, consider a starting ver¬ 
tex lying very much far away from the group of targets covered by s. In such case. 
Algorithm will soon stop iterating through covering set cardinalities being not 
able to further generate feasible sets. Such feature is induced by the large distance 
of the starting vertex from the targets covered by s together with the low edge 
density and the spatial locality shared among targets covered by the same signal 
(these last two are, indeed, features that frequently recur in realistic scenarios). 
For the harder case, just consider a situation in which the distance of the starting 
vertex from the targets covered by s is such that a large number of covering routes 
is available. An example of this kind can be inspected in Figure Interestingly, 
a similar high variance trend cannot be observed when depicting the statistical 


distribution of the compute time per starting vertex. The boxplot of Figure 20(b) 


suggests that, by fixing the starting vertex and solving for different signals, hard 
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instances counterbalance, on average, the easy ones. 


7. Related works 

In the last few years. Security Games received an increasing interest from the 
Artificial Intelligence scientific community, leading to the exploration of a large 
number of research directions around this topic. In this section, we briefly discuss 
what we deem to be the most significant ones, starting from the game theoretical 
foundations on which these models are built. 

Computing solution concepts is the central problem upon which the real appli¬ 
cability of these game theoretical models is based. A lot of works concentrated on 
algorithmic studies of this topic, analysing the relationships holding among dif¬ 
ferent kinds of solution concepts and their computational complexity. In ||4^ the 
relationship between Stackelberg, Nash and min-max equilibria is studied, while 
in m some refinements of the Stackelberg equilibrium are proposed. Many ef¬ 
forts have been made to develop tractable algorithms for finding Stackelberg equi¬ 
libria in Bayesian games [|4^ . Furthermore, in [|43l the authors analysed scenarios 
in which the Defender has multiple objectives, searching for the Pareto curve of 
the Stackelberg equilibria. 

Besides fundamental works like the ones cited above, a more recent research 
line devoted efforts towards the definition of game model refinements in the at¬ 
tempt to overcome some of their ideal assumptions. One remarkable issue belong¬ 
ing to this scope is how to model the behaviour of the Attacker. In the attempt to 
have a more realistic behaviour, some works considered bounded rationality and 
defined algorithms to deal with it. In ll^ different models of the Attacker are anal¬ 
ysed while in [l45l |46]| the Attacker is allowed to have different observation and 
planning capabilities. Moreover, in WT\ Quantal-Best Response is used to model 
the behaviour of the Attacker and in algorithms that scale up with bounded 
rational adversaries are proposed. In our paper, we assume that the attacker is 
rational. 

Other model refinements focused on those cases in which games exhibit spe¬ 
cific structures that can be leveraged in the design of algorithms to compute the 
Stackelberg equilibrium. For instance, the study of the spread of contagion over 
a network is investigated in [|49||. When no scheduling constraints are present and 
payoffs exhibit a special form, the computation of a Stackelberg equilibrium can 
be done very efficiently enabling the resolution of remarkably big scenarios [l50l . 
In [|5T]| realistic aspects of infrastructures to be protected are taken into account. 
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8. Conclusions and future research 


In this paper we provide the first Security Game for large environments surveil¬ 
lance, e.g. for fair sites protection, that can exploit an alarm system with spatially 
uncertain signals. To monitor and protect large infrastructure such as stations, air¬ 
ports, and cities, a two-level paradigm is commonly adopted: a broad area surveil¬ 
lance phase, where an attack is detected but only approximately localized due to 
the spatially uncertainty of the alarm system, triggers a local investigation phase, 
where guards have to find and clear the attack. Abstracting away from techno¬ 
logical details, we propose a simple model of alarm systems that can be widely 
adopted with every specific technology and we include it in the state-of-art pa¬ 
trolling models, obtaining a new security game model. We show that the problem 
of finding the best patrolling strategy to respond to a given alarm signal is AVX- 
hard with arbitrary graphs even when the game is zero-sum. Then, we provide two 
exponential-time exact algorithms to find the best patrolling strategy to respond 
to a given alarm signal. The first algorithm performs a breath-first search by ex¬ 
ploiting a dynamic programming approach, while the second algorithm performs 
a depth-first approach by exploiting a branch-and-bound approach. We provide 
also a variation of these two algorithms to find an approximate solution. We ex¬ 
perimentally evaluate our exact and approximation algorithms both in worst-case 
instances, to evaluate empirically the gap between our hardness results and the the¬ 
oretical guarantees of our approximation algorithms, and in one realistic instance. 
Expo 2015. The limit of our exact algorithms is about 16 targets with worst- 
case instances while we were able to compute an optimal solution for a realistic 
instance with ^ 170 targets. On the other side, our approximation algorithms pro¬ 
vide a very effective approximation even with worst-case instances. We provide 
also results for special topologies, showing that our dynamic programming algo¬ 
rithm requires polynomial time with linear and cycle graphs, while the problem 
is ATP-hard with tree graphs. Finally, we focus on the problem of patrolling the 
environment, showing that if every target is alarmed and no false positives and 
missed detections are present, then the best patrolling strategy prescribes that the 
patroller stays in a given place waiting for an alarm signal. Furthermore, we show 
that such a strategy may be optimal even for missed detection rates up to 50%. 

Of course, our research does not end here since some problems related to our 
model remain open. The main theoretical issue is the closure of the approximation 
gap of SRG-w. We believe that investigating the relationship between our model 
and the DEADFINE-TSP could help in closing the gap. Another interesting prob¬ 
lem is the study of approximation algorithms for tree graphs. Our A/’P-hardness 
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result does not exclude the existence of a PTAS (i.e., polynomial time approxi¬ 
mation scheme), even if we conjecture that the existence is unlikely. In addition, 
a number of extensions of our model are worth being explored. The most im¬ 
portant extension is to include false positives and missed detections, allowing the 
patroller to patrol even in absence of alarm signals. Other interesting extensions 
regard cases in which the number of patrollers is larger than one or there are 
multiple attackers, which coordinate to perform their malicious attack. Finally, a 
different research direction stemming from the problem concerns the alarm sys¬ 
tem dimension. Indeed, trying to deploy sensors and devices in the environment 
in such a way to maximize the utility in responding to alarms is a non-trivial and 
interesting problem, mainly due to the inherent budget constraint and trade-offs 
that would exhibit. 
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Appendix A. Notation 


We report in Tab. A.3 the symbols used along the paper. 


Appendix B. Additional experimental results 


We report in Fig. B.21 the boxplots of the results depleted in Fig. [T^ They 
show that the varianee of the eompute times drastieally reduees as e inereases. 
This is beeause the number of edges inereases as e inereases and so the number 
of proper eovering sets inereases approaehing On the other hand, with small 
values of e, the number of proper eovering sets of different instanees ean be ex¬ 
tremely different. 
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Symbol 

Meaning 


A 

Attacker 


V 

Defender 


G 

Graph 


V 

Set of vertices 


V 

Vertex 

Ij 

Vi 

i-th vertex 

S 

E 

Set of edges 

_o 

c/3 

{v,v') 

Edge 

cd 

m 

<v' 

Temporal cost (in turns) of the shortest path between v and v’ 


T 

Set of targets 


t 

Target 


ti 

i-th target 


Tr{t) 

Value of target t 


d{t) 

Penetration time of target t 


S 

Set of signals 


s 

Signal 

c/3 

P 

Function specifying the probability of having the system generating 

c 


signal s given that target t has been attacked 

.£P 

T{s) 

Targets having a positive probability of raising s if attacked 


S{t) 

Signals having a positive probability of being raised if t is attacked 


± 

No signals have been generated 


A 

No targets are under attack 


^V,S 

Set of routes starting from vertex v when signal s is generated 


r 

Route 


Vi 

i-th route 


r{i) 

i-th element visited along route r 


UAinAi) 

Attacker’s utility given a route r and a target t 

c/3 

_CJ 


Defender’s strategy 

D 

O 


Defender’s strategy starting from vertex v 



Defender’s strategy starting from vertex v when signal s is generated 



Attacker’s strategy 


(J'^ 

^ V 

Attacker’s strategy when "D is in u 


9v 

Value of the game (utility of A) 


A{r{{)) 

Time needed by V to visit r(i) starting from r(0) 


T{r) 

Set of targets covered by route r 


c(r) 

Temporal cost (in turns) associated to r 


Table A.3: Symbols table. 
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Number of targets 


Number of targets 





Number of targets 


Number of targets 


Figure B.21; Boxplots of compute times required by our exact dynamic programming algorithm. 
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